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Attorney Docket No.: 06504 -14882 -088001 

NO¥E^CYTIDINE DEAMINASE 

[001] This application is a continuation-in-part of PCT/JP00/01918, filed March 28, 2000, 
5 and claims priority from Japanese Patent Application No. 1 1-87192, filed March 29, 1999; 
Japanese Patent Application No. 1 1/178999, filed June 24, 1999; and Japanese Patent 
Application No. 11/371382, filed December 27, 1999. 

FIELD OF INVENTION 
[002] The present invention relates to novel proteins having a cytidine deaminase activity; 

10 DNAs and fragments thereof (cDNAs, genomic DNAs, and primer DNAs) encoding the 
proteins; expression vectors comprising the DNAs; transformants transformed with the 
expression vectors; antibodies reactive to the proteins or fragments thereof; cells producing 
the antibodies; and methods for identifying substances that regulate production of the 
proteins, transcription of genes encoding the proteins into mRNAs, or enzyme activities of 

15 the proteins.T 

BACKGROUND 

[003] The germinal center of mammals comprises a highly specialized microenvironment 
required for the final process of maturation towards antigen specific memory cells and long- 
lived plasma cells (Embo J., 16:2996-3006, 199; Semin. Immunol., 4:11-17, 1992). In the 

20 this microenvironment. it is known that two fundamental editings of the immunoglobulin 
genes take place (J. Exp. Med., 173:1165-1175, 1991; Embo. J., 12:4955-4967, 1993; Adv. 
Exp. Med. Biol., 186:145-151, 1985; Nature, 342:929-931, 1989; Cell, 67:1121-1129). 
[004] The first fundamental editing is One is the somatic hypermutation (Curr. Opin. 
Immunol, 7:248-254, 1995; Annu. Rev. Immunol., 14:441-457, 1996; Science, 244:1152- 

25 1 1 57, 1 989), a phenomenon in which extensive point mutation in the exons of exen-genes 
encoding variable regions of immunoglobulins occurs. Accumulation of4he point mutations 
leads to selection of B cells expressing high affinity immunoglobulins on their cell surface, 
accompanied by the affinity maturation of antibodies (Embo. J., 4:345-350, 1985; Proc. Natl. 
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Acad. Sci. USA, 85:8206-8210, 1988). As the-ajresult, immunoglobulin genes are edited as 
new functional genes. 

[005] Anothe r fundamental editing process is the class switch recombination (CSR). In the 
r e combination CSR , effector functions of antibodies, such as complement fixation, are 
5 selected by exchanging exons encoding constant regions of immunoglobulin heavy chains 
(Curr. Top. Microbiol. Immunol., 217:151-169, 1996; Annu. Rev. Immunol, 8:717-735, 
1990). 

[006] These two types of genetic editing are very important for effective humoral 
immunoreaction to eliminate harmful microbes. The molecular mechanisms of the genetic 
10 phenomena have not yet been elucidated despite the-extensive studi e s study for several 
decades. 

[007] The present inventors isolated amouse B cell clone, CH12F3-2, as a research tool to 
elucidate the molecular mechanism of class switch recombination of immunoglobulin. In the 
this B cell line, class switch recombination (CSR) from IgM to IgA begins several hours after 
15 stimulation with IL-4, TGF-0, and CD40L;^and finaH vultimatelv , over 80% of the cells 
become IgA positive (Immunity, 9:1-10, 1998; Curr. Biol., 8:227-230, 1998; Int. Immunol., 
8:193-201,1996). 

[008] Using the mouse B cell clone CH12F3-2, the present inventors had- previouslv 
reported that the breakpoints of CSR distribute not only in the switch region ( or " S region"), 

20 characte rizedi stie by repeated sequences, but also in the-neighboring sequences (Curr. Biol., 
8:227-230, 1998). However, the breakpoints were rarely seen in I exon and C exon, which 
are locateding-at upstream and downstream of the S region, respectively. Also, according to 
the-accumulated scientific evidence, it has been shown that the-transcription of I exon and 
C exon and the-splicing of the transcripts are essential for CSR (Cell, 73:1155-1164, 1993; 

25 Science, 259:984-987, 1993; Proc. Natl. Acad. Sci, USA, 90:3705-3709, 1993; Cell, 81:833- 
836, 1995). 

[009] This nam e ly suggests that the transcripts are involved in CSR either directly or 
indirectly. Accordingly, the present inventors propose a theory that class switch is initiated 
by the recognition of DNA-RNA complex structure and not by the recognition of nucleotide 
30 sequences of the_switch region. This idea is further fortified by the fact that even when the m 
th e cas e that Sa region is substituted with an_Sa region or anjSy region by introducing a mini- 
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chromosome into thet e above-mentioned mouse B cell clone CH12F3-2, CSR in the mini- 
chromosome efficiently occurs after stimulation with b y stimuli of cytokines (Immunity, 9:1- 
10, 1998). 

[0010] In plants and Pprotozoa, RNA editing, another type of genetic editing, is widely used 
5 as a mean for producing functional genes from ajimited genome (Cell, 81 :833-836, 1995; 
Cell, 81 : 837-840, 1995). mRNA editing of many molecules such as the mRNAef-for 
apolipoprotein B (apoB), AMPA receptors, Wilmstumor-1, aa-galactosidase and 
neurofibromatosis type-1, and tRNA-Asp, have been reported (Trends Genet., 12:418-424, 
1996; Curr. Opin. Genet. Dev., 6:221-231, 1996). Although the molecular mechanism of 
1 0 mammalian RNA editing has not yet been elucidated, one performed by APOBEC-1 

(apolipoprotein B mRNA editing enzyme, catalytic polypeptide- 1) is becoming b e comes 
understood by degrees (Science, 260:1816-1819, 1993; J. Biol. Chem., 268:20709-20712, 
1993). 

[0011] In apoB RNA editing, the first base C (cytosine) of codon CAA, which encodes 

15 glutamine, is converted to U (uridine), which alters the codon to UAA. As athe-result, an in- 
frame stop codon is made in the apoB mRNA (J. Cell., 81:187-195, 1995; J. Cell., 50:831- 
840, 1987; Science, 238:363-266, 1987). apoB-48 and apoB-100 are transcripts of edited 
mRNA and unedited mRNA of apoB, respectively, and these proteins possess totally different 
physiological functions fromer each other (J. Biol. Chem., 271:2353-2356, 1996). 

20 [0012] In the-site-specific RNA-editing, auxiliary factors are required (Science, 260:1816- 
1819, 1993; J. Biol. Chem., 268:20709-20712, 1993). In the absence of auxiliary factors, 
APOBEC-1 shows only a cytidine deaminase activity, possessing non-specific low affinity to 
RNA (J. Biol. Chem., 268:20709-20712, 1993; J. Cell., 81:187-195, 1995; J. Biol. Chem., 
270:14768-14775, 1995; J. Biol. Chem., 270:14762-14767, 1995). The expression and 

25 activity of the auxiliary factors are found not only in organs with apoB mRNA editing, but 
also in organs with undetectable levels of APOBEC-1 expression, or organs without apoB 
mRNA editing (Science, 260:1816-1819, 1993; J. Biol. Chem., 268:20709-20712, 1993; 
Nucleic Acids Res., 22:1874-1879, 1994; Proc Natl. Acad. Sci, USA, 91:8522-8526, 1994; J. 
Biol. Chem., 269:21725-21734, 1994). 

30 [0013] The unexpected expression of the auxiliary factors involved in apoB mRNA editing 
suggests that the auxiliary factors may be involved in more general cellular functions or other 
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yet unknown RNA editing. Since th e r e ar e possibiliti e s the possibility exists that CSR and 
hypermutation, which are involved in g enetic editing of r e lating to immunoglobulin genes , 
may be accomplished by RNA editing, it is would be very interesting to elucidate whether 
RNA editing takes place or not in the genetic editing of immunoglobulin gene s as mentioned 
5 above. 

SUMMARY 

[0014] The present invention provides AED (Activation-Induced cytidine Deaminase), a 
novel cytidine deaminase haviftfr -that is structural ly related r e lationship to APOBEC-1, one 

10 e£an RNA editing enzyme, and ismvolved in RNA editing in germinal center B cells, where 
genetic editing of immunoglobulin genes occurs, and a-DNA encoding the new enzyme. 
[0015] The present inventors intensively searched for novel genes involved in class switch 
recombination (CSR), one of the major types of g enetic editing of immunoglobulin genes. 
As a result, by preparing cDNA libraries for the mouse B cell clone CH12F3-2; (in which 

15 class switch recombination from IgM to IgA is shown to occur at an extremely high rate upon 
tog e th e r with activation of the cells by stimulation with cytokines}, with and without 
stimulatiorpg with cytokines, and performing subtraction cloning using the libraries, the 
present inventors found genes encoding mouse- and human-derived novel proteins named 
AID (Activation-Induced cytidine Deaminase), having a_structural relationship to APOBEC- 

20 1, one of the RNA editing enzymes, and having a cytidine deaminase activity similar to 
APOBEC-1 . 

[0016] The AID protein in the present invention possesses features described below, and is 
considered to be a very important RNA-modifying deaminase involved in regulating B cell 
activation, CSR of immunoglobulin genes, somatic hypermutation, and affinity maturation, 
25 which aH-are all involved in g enetic editing specific to germinal center function: 

(1) The ORF of the cDNA encoding the AID protein comprises 198 amino 
acids, with a_24kDa calculated molecular weight (mouse: SEQ ID NO:2, and human:. 
SEQ ID NO:8). Meuse -The mouse AID protein shows an approximately 28kDa 
molecular weight by SDS-PAGE. 
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(2) The amino acid sequence of the AID protein is 34% and 26% identical 
to APOBEC-1 (apolipoprotein B mRNA editing enzyme, catalytic polypeptide- 1) at 
the amino acid sequence level, for mouse and human derived protein^ respectively. 

(3) The AID protein has a_cytidine/deoxycytidine deaminase motif, which 
is the active center of the deaminase activit y whiehand is conserved in amino acid 
sequences of proteins belonging to the cytosine nucleoside/nucleotide deaminase 
family. 

(4) The cG vtidine deaminase motif of the AID protein is allied with an 
RNA editing deaminase subgroup. 

(5) The AID protein has_a feleucine-rich region considered to be important 
in protein-protein interaction, similar to APOBEC-1 . Four leucines in this leucine- 
rich region of the AID protein are conserved in the leucine-rich region of APOBEC-1 
in rabbit, rat, mouse and human. 

(6) In the primary structure of the AID protein, all of the amino acid 
residues reported to be necessary for APOBEC-1 to bind RNA(Phe66, Phe87, His61, 
Glu63 and Cys93) are conserved. 

(7) The AID protein has_a pseudoactive site domain in its C terminal for 
forming homodimers, similar to APOBEC-1 and ECDDA, an E. coli derived cytidine 
deaminase. There is a ar e possibiliti e s possibility that the AID protein forms 
homodimers, or associates with other auxiliary proteins. 

(8) The AID protein shows a concentration-dependent cytidine deaminase 
activity. The activity can be inhibited dose dependently by tetrahydrouridine (THU), 
a specific inhibitor of cytidine deaminase. Also, a zinc chelator, 1,1 0-o- 
phenanthroline, inhibits the cytidine deaminase activity of the AID protein while 1,7- 
o-phenanthroline, the inactive isomer, shows a weak inhibition. Thus, the AID 
protein can considered to be a zinc-dependent cytidine deaminase,, as is_APOBEC-l. 

(9) Strong expression of AID mRNA e xpr e ssion is seen in lymph nodes 
(mesenteric and amygdaline). Also, weak expression in spleen is seen. 

(10) mRNA eExpression of AID pretemmRNA is seen in a variety of 
lymphoid tissues (Peyer's patches, mesenteric lymph node, axillary lymph node, 
spleen, and bone marrow). Especially^ notable expression is seen in peripheral 



lymphoid organs, such as lymphatic nodes and Peyer's patches. Contrariwis e , In 
contrast, expression in primary lymphoid organs is lower than the peripheral 
lymphoid organs. 

(11) Expression of AID mRNA is at the lower limit of detection limit l e v e l 
without cytokines (IL-4, CD40L, TGF-p) stimulation in mouse B cell clone CH12F3- 
2, in which the cytokines stimulate class switch from IgM to IgA in the cells.^ 
wh e r e as th e . eExpression is induced 3 hours after stimulation, and maximal 
expression is seen after 12 hours, with cytokine stimulation. 

(12) AID mRNA expression in mouse B cell clone CH12F3-2 can be 
induced more strongly when stimulated with all three cytokines, IL-4, CD40L and 
TGF-p, simultaneously, rath e r than stimulat e d w ith any one of them alone . Also, it 
can be considered that de novo protein synthesis is necessary for augmentation of 
AID mRNA expression, as the AID mRNA expression induction by cytokines in 
mouse B cell clone CH12F3-2 can be inhibited by cycloheximide, an protein 
synthesis inhibitor. 

(13) In the-zn vitro tests, an augmentation of AID mRNA expression can be 
seen when normal mouse spleen B cells are stimulated with LPS alone, LPS+IL-4, or 
LPS+TGF-p. 

(14) In the-m vivo tests, when normal mice are immunized with sheep red 
blood cells (SRBC), a significant augmentation of AID mRNA expression can be seen 
5_days after immunization, in which SRBC are known to induce clonal expansion, 
germinal center formation, and class switch recombination and affinity maturation of 
immunoglobulin genes. 

(15) The in vivo augmentation of AID mRNA expression by SRBC 
immunization is specifically seen in splenic CD19 positive B cells. 

(16) AID mRNA expression in lymphoid organs is specifically seen in the 
germinal center, enriched with B cells activated by antigen stimulation. 

(17) The Hhuman AID gene ieeates- is located at locus 1 2p 1 3 , close to the 
locus 12pl3.1, where the APOBEC-1 gene is leeates located . 

[0017] According to the characteristics described above, the AID protein of the present 
invention can be considered to have a function of regulating various biological mechanisms 



required for generation of antigen-specific immunoglobulins (specific antibodies), which 
eliminate non-self antigens (foreign antigen, self-reacting cells, etc.) that triggers various 
diseases. The mechanism for generation of immunoglobulin having high specificity to 
antigens includes germinal center functions such as activation of B cells, class switch 
5 recombination of immunoglobulin genes, somatic hypermutation, and affinity maturation. 
The AID protein of the present invention can be considered to be one of the enzymes that 
play an important role in the genetic editing occurring in germinal center B cells (e.g. class 
switch recombination and somatic mutation). 

[0018] The dysfunction of the AID protein of the present invention can be the cause for th eof 
10 humoral immunodeficiency since it induces failure of germinal center B cell function, such 
as antigen-specific B cell activation, class switch recombination, and somatic mutation. 



autoimmune disease since it can cause inappropriate B cell activation and needless class 
switch recombination and somatic mutation. 

15 [0019] Therefore, regulation of the function of AID protein and the gene encoding it enables 
pr e v e nting prevention and tr e ating treatment of various immunodeficiencies, autoimmune 
diseases, and allergies, which result from, for example, B cell dysfunctions (e.g. A 
IgA deficiency, IgA nephropathy, y globulinemia, hyper IgM syndrome, etc.) or class switch 
deficiency of immunoglobulin. Thus, the AID protein and the gene encoding the AID protein 

20 can be targets for the development of drugs for therapy of diseases mentioned above. 

[0020] Examples of diseases whose onset prevention, symptom remission, therapy and/or 
symptomatic treatment effect is expected by regulating the function of the AID protein of the 
present invention or the gene encoding it include, for example, primary immunodeficiency 
syndrome with congenital disorder of immune system, mainly various immunodeficiencies 

25 considered to develop by B cell deficiency, decrease, or dysfunction (e.g., sex-linked 

agammaglobulinemia, sex-linked agammaglobulinemia with growth hormone deficiency, 
immunoglobulin deficiency with high IgM level, selective IgM deficiency, selective IgE 
deficiency, immunoglobulin heavy chain gene deletion, k chain deficiency, IgA deficiency, 
IgG subclass selective deficiency, CVTD (common variable immunodeficiency), infantile 

30 transient dysgammaglobulinemia, Rosen syndrome, severe combined immunodeficiency 
(sex-linked, autosomal recessive), ADA (adenosine deaminase) deficiency, PNP (purine 




•Conversely, the hyperfunction of the AID protein may induce allergy disease or 
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nucleoside phosphorylase) deficiency, MHC class II deficiency, reticular dysplasia, Wiskott- 
Aldrich syndrome, ataxia telangiectasia, DiGeorge syndrome, chromosomal aberration, 
familial Ig hypermetabolism, hyper IgE syndrome, Gitlin syndrome, Nezelof syndrome, 
Good syndrome, osteodystrophy, transcobalamin syndrome, secretory bead syndrome, etc.), 
5 various diseases with antibody production deficiency that are secondary immunodeficiency 
syndromes with a_disorder of immune system caused by an acquired etiology (for example, 
AIDS, etc.), and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, 
conjunctivitis, allergic rhinitis, allergic enteritis, drug-induced allergy, food allergy, allergic 
urticaria, glomerulonephritis, etc.). 

10 [0021] Nam e ly, t The AID proteins of the present invention, a fragment thereof, a DNA 
encoding the AID protein, a fragment thereof, and an antibody against the AID protein are 
useful as reagents for developing drugs for prevention and therapy of such diseases. 
[0022] Also, the DNA itself is useful as an antisense drug regulating the function of the AID 
gene at a gene level and in a us e in gene therapy The protein or the fragments thereof (e.g. 

15 enzyme active site) its e lf is themselves are useful as a-drugs. 

[0023] Furthermore, a DNA comprising a compl e m e ntary nucleotide sequence that is 
complementary to an arbitrary partial nucleotide sequence in the nucleotide sequence of 
genomic DNA encoding AID protein of the present invention (especially human AID protein) 
is useful as a primer DNA for polymerase chain reaction (PCR). 

20 [0024] An arbitrary partial nucleotide sequence of genomic DNA encoding the AID protein 
(especially human AID protein) of the present invention can be amplified by PCR using the 
primer DNA pair. For example, in the case that mutation or deletion of the nucleotide 
sequence of genomic DNA (especially exon) encoding AID protein is presumed to cause a 
certain immunodeficiency or an allergy, mutations and deletions in the genomic DNA can be 

25 identified by amplifying an arbitrary partial nucleotide sequence of genomic DNA encoding 
the AID protein obtained from tissue or cells of immunodeficiency or allergy patients by 
PCR using a pair of fee-primer DNAs, by analyzing the presence and the size of PCR 
products T and the nucleotide sequence of the PCR products, and by comparing the nucleotide 
sequence with the corresponding nucleotide sequence in the genomic DNA encoding the AID 

30 protein derived from the-anormal human. That is to say, this method is capable of not only, 
for example, elucidating relationships between immunodeficiency or allergy and AID 
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protein, but also, in the case where - the ADD protein is the cause of onset of a sort of disease 
(e.g. immunodeficiency and/or allergy), diagnosing the diseases by the methods mentioned 
above. 

[0025] Furthermore, an antibody reactive to the AID protein of the present invention or a 
5 fragment thereof is extremely useful as an antibody drug by regulating functions of the AID 
protein. 

[0026] Furthermore, the gene (DNA), protein, and antibody of the present invention are 
useful as reagents for s e arching searching for substrates (e.g. RNA, etc.) that interacting 
(binding) with the protein (enzyme) of the present invention, or other auxiliary proteins 
10 associated with the protein of the present invention, and for developing drugs targeting the 
substrates and auxiliary proteins. 

[0027] Also, model animals can be generated by disrupting (inactivating) the AED gene 
based on the genetic information on the ADD protein derived from mammals (e.g. mouse, 
etc.), which is one embodiment of the DNA of present invention. By analyzing the physical, 
15 biological, pathological, and genetic features of the model animal, it is possible to elucidate 
functions of the genes and the proteins of the present invention. 

[0028] Furthermore, by introducing abnormal human AID gene or mutant human ADD gene 
(e.g. mutant human ADD genes derived from immunodeficiency patients), which is one 
embodiment of the present invention, into the model animal whose endogenous gene has 
20 been disrupted, model animals having only normal or mutant human ADD genes of the 

present invention can be generated. By administering drugs (compounds, antibodies, etc.) 
targeting the introduced human AID genes to the model animals, therapeutic effects of the 
drugs can be evaluated. 

[0029] Furthermore, a method for identifying a substance that regulates production of the 
25 ADD protein of the present invention or transcription of a gene encoding the ADD protein into 
mRNA, or a substrate that inhibits the enzyme activity of the ADD protein (e.g. cytidine 
deaminase activity) are-is_extremely useful as ameans to develop drugs for therapy and 
prevention of various diseases (especially, immunodeficiency and/or allergy) in which the 
above-mentioned ADD protein or ADD gene is considered to be involved. 
30 [0030] Thus, the present invention, for the first time, provides blew the below -mentioned 
DNAs (cDNAs, genomic DNAs, and an arbitrary fragment thereof), proteins, expression 
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vectors, transformants, antibody pharmaceutical compositions, cells, the use of the DNA 
fragments as primer DNAs, and methods for screening. 

(1) A DNA or a fragment thereof encoding a protein comprising the amino 
acid sequence of SEQ ID NO:2 or 8. 

(2) The DNA or the fragment of (1), wherein the protein has a cytidine 
deaminase activity. 

(3) A DNA or a fragment thereof comprising the nucleotide sequence of 
SEQ ID NO: 1 or 7. 

(4) A DNA or a fragment thereof comprising a nucleotide sequence of (a) 
or (b) below: 

(a) a nucleotide sequence comprising the nucleotide residues 93 to 
689 ofSEQIDNO:l or 

(b) a nucleotide sequence comprising the nucleotide residues 80 to 
676ofSEQIDNO:7. 

(5) A DNA or a fragment thereof of (a) or (b) below: 

(a) a DNA or a fragment thereof that hybridizes under stringent 
conditions with a DNA comprising the nucleotide sequence of SEQ ID NO:l and that 
encodes a mammal-derived protein being homologous to a protein that comprises the 
amino acid sequence of SEQ ID NO:2 and having a cytidine deaminase activity or 

(b) a DNA or a fragment thereof that hybridizes under stringent 
conditions with a DNA comprising the nucleotide sequence of SEQ ID NO:7 and that 
encodes a mammal-derived protein being homologous to a protein that comprises the 
amino acid sequence of SEQ ID NO:8 and having a cytidine deaminase activity. 

(6) A protein or a fragment thereof comprising the amino acid sequence of 
SEQ ID NO:2 or 8. 

(7) A protein or a fragment thereof comprising substantially the same 
amino acid sequence as that of SEQ ID NO:2 or 8 and having a cytidine deaminase 
activity. 

(8) A proteins of (a) or (b) below. 

(a) a mammal-derived protein that comprises an amino acid 
sequence encoded by a DNA hybridizing under stringent conditions with a DNA 
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comprising the nucleotide sequence of SEQ ED NO:l, that is homologous to a protein 
comprising the amino acid sequence of SEQ ID NO:2, and that has a cytidine 
deaminase activity^ or 

(b) a mammal-derived protein that comprises an amino acid 
sequence encoded by a DNA hybridizing under stringent conditions with a DNA 
comprising the nucleotide sequence of SEQ ID NO: 7, that is homologous to a protein 
comprising the amino acid sequence of SEQ ID NO: 8, and that has a cytidine 
deaminase activity. 

(9) An expression vector comprising the DNA or the fragment of any one 
of(l)to (5). 

(10) A trans formant transformed with the expression vector of (9). 

(11) An antibodyies or a portion thereof reactive to the protein of any one 
of (6) to (8) or to a fragment of the protein. 

(12) The antibodyies or the portion thereof of (11), wherein the antibody is 
a monoclonal antibody. 

(13) A pharmaceutical composition comprising the antibody or the portion 
thereof of (11) or (12), and a pharmaceutically acceptable carrier. 

(14) A cell producing a monoclonal antibody reactive to the protein of any 
one of (6) to (8) or to a fragment of the protein. 

(15) The cell of (14), wherein the cell is a hybridoma obtained by fusing, 
with a mammal-derived myeloma cell, a non-human mammal-derived B cell that 
produces a monoclonal antibody. 

(16) The cell of (1 5), wherein the cell is a transgenic cell transformed by 
introducing, into a cell, either or both of a DNA encoding a heavy chain of the 
monoclonal antibody and a DNA encoding a light chain of the monoclonal antibody 

(17) A genomic DNA or a fragment thereof comprising a nucleotide 
sequence of any one of (a) to (c) below: 

(a) SEQ ID NO:9, 

(b) SEQ ID NO: 10, or 

(c) SEQIDNO:35. 
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(18) A genomic DNA or a fragment thereof comprising a nucleotide 
sequence of any one of (a) to (e) below: 



(a) 


SEQ ID NO 


11, 


(b) 


SEQ ID NO 


12, 


(c) 


SEQ ID NO 


13, 


(d) 


SEQ ID NO 


14, or 


(e) 


SEQ ID NO 


15. 



(19) A DNA comprising a compl e m e ntary nucleotide sequence 
complementary to an arbitrary partial nucleotide sequence of a nucleotide sequence of 
any one of (a) to (h) below: 



(a) 


SEQ ID NO 


9, 


(b) 


SEQ ID NO 


10, 


(c) 


SEQ ID NO 


11, 


(d) 


SEQ ID NO 


12, 


(e) 


SEQ ED NO 


13, 


(f) 


SEQ ID NO 


14, 


(g) 


SEQ ID NO 


15, or 


(h) 


SEQ ID NO 


25. 



(20) The DNA of (19), wherein the DNA comprises a nucleotide sequence 
of any one of (a) to (q) below: 



(a) 


SEQ ID NO: 18, 


(b) 


SEQ ID NO: 19, 


(c) 


SEQ ID NO:20, 


(d) 


SEQIDNO:21, 


(e) 


SEQ ID NO:22, 


(f) 


SEQ ID NO:23, 


(g) 


SEQ ED NO:24, 


(h) 


SEQ ID NO:25, 


(i) 


SEQ ID NO:26, 


0) 


SEQ ID NO:27, 


(k) 


SEQ ID NO:28, 
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(1) SEQ ID NO:29, 
(m) SEQ ID NO:30, 
(n) SEQIDNO:31, 

(0) SEQ ID NO:32, 
(p) SEQ ID NO:33 ? or 
(q) SEQ ID NO:34. 

(21) Use of the DNA of (19) or (20) as a primer DNA in polymerase chain 
reaction. 

(22) Use of a pair of DNAs of any one of (a) to (n) below as primer DNAs 
in polymerase chain reaction: 

(a) a DNA comprising the nucleotide sequence of SEQ ID NO: 3 1 
and a DNA comprising the nucleotide sequence of SEQ ID NO:32, 

(b) a DNA comprising the nucleotide sequence of SEQ ID NO:20 
and a DNA comprising the nucleotide sequence of SEQ ID NO:22, 

(c) a DNA comprising the nucleotide sequence of SEQ ID NO:21 
and a DNA comprising the nucleotide sequence of SEQ ID NO:30, 

(d) a DNA comprising the nucleotide sequence of SEQ ID NO:24 
and a DNA comprising the nucleotide sequence of SEQ ID NO:25, 

(e) a DNA comprising the nucleotide sequence of SEQ ID NO:23 
and a DNA comprising the nucleotide sequence of SEQ ID NO:27, 

(f) a DNA comprising the nucleotide sequence of SEQ ID NO:23 
and a DNA comprising the nucleotide sequence of SEQ ID NO:28, 

(g) a DNA comprising the nucleotide sequence of SEQ ID NO:23 
and a DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(h) a DNA comprising the nucleotide sequence of SEQ LD NO:26 
and a DNA comprising the nucleotide sequence of SEQ ID NO:27, 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO:26 
and a DNA comprising the nucleotide sequence of SEQ ED NO:28, 

(j) a DNA comprising the nucleotide sequence of SEQ ID NO:26 
and a DNA comprising the nucleotide sequence of SEQ ID NO:29, 
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(k) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 
and a DNA comprising the nucleotide sequence of SEQ ID NO:28 5 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO:34 
and a DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(m) a DNA comprising the nucleotide sequence of SEQ ID NO:33 
and a DNA comprising the nucleotide sequence of SEQ ID NO:29, or, 

(n) a DNA comprising the nucleotide sequence of SEQ ID NO: 1 8 
and a DNA comprising the nucleotide sequence of SEQ ID NO: 19. 

(23) A method for identifying a substance that regulates transcription of a 
gene encoding an AID protein comprising the amino acid sequence of SEQ ID NO:2 
or 8 into mRNA, or production of the AED protein, the method comprising the steps 
of: 

(a) culturing, separately in the presence and the absence of the 
substance, cells producing the AID protein and 

(b) (i) comparing the level of the AID protein produced by the 
cells cultured in the presence of the substance with the level of the ATP protein 
produced by the cells cultured in the absence of the substance or 

(ii) comparing the level of the AID protein-encoding mRNA 
transcribed in the cells cultured in the presence of the substance with the level of the 
AID protein-encoding mRNA transcribed in the cells cultured in the absence or the 
substance. 

(24) A method for identifying a substance that regulates transcription of a 
gene encoding an AID protein comprising the amino acid sequence of SEQ ED NO:2 
or 8 into mRNA, or production of the AID protein, the method comprising the steps 
of: 

(a) culturing, separately in the presence and the absence of the 
substance, cells producing the AID protein and a protein other than the AID protein, 
wherein transcription of a gene encoding the other protein into mRNA is dependent in 
the cells on the degree of a signal of transcription of the gene encoding the ATT) 
protein into mRNA and 
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(b) comparing the level of the other protein produced by the cells 
cultured in the presence of the substance with the level of the other protein produced 
by the cells cultured in the absence of the substance. 

(25) The method of (23) or (24), wherein the cells are transgenic cells 
5 transformed with a gene encoding the protein. 

(26) The method of (24), wherein the cells are transgenic cells transformed 
with a gene encoding the protein and a gene encoding the other protein. 

(27) The method of (26), wherein the protein is a reporter protein. 

(28) The method of (27), wherein comparison of the level of the other 
10 protein is comparison of the level of a signal generated by the reporter protein. 

(29) The method of (27) or (28), wherein the reporter protein is luciferase. 

(30) A method for identifying a substance that inhibits an enzyme activity 
of an AID protein comprising the amino acid sequence of SEQ ID NO:2 or 8, the 
method comprising the step of (a) or (b) below: 

15 (a) culturing, separately in the presence and the absence of the 

substance, mammal-derived B cells or tissues comprising the B cells, and comparing 
enzyme activities of the AID protein in the B cells separately cultured or 

(b) (i) administering the substance separately to an AID gene 
knockout mouse whose endogenous AID gene is inactivated so that transcription of 
20 the endogenous AID gene into mRNA is inhibited, and to a normal mouse and 

(ii) comparing enzyme activities of the AID proteins in the B 
cells isolated from the respective mice. 

(31) The method of (30), wherein the enzyme activity is a cytidine 
deaminase activity 

25 . [0031] Hereafter, the present invention is explained in detail, by clarifying the terms used in 
the present invention and general methods for producing the proteins, DNAs, antibodies, and 
cells of the present invention. 

[0032] The "protein or a fragment thereof means a protein and a fragment thereof derived 
from a mammal such as human, bovine, sheep, pig, goat, rabbit, rat, hamster, guinea pig, 
30 mouse, and so on, preferably a protein or a fragment thereof derived from human, rabbit, rat, 
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or mouse, and particularly preferably, a protein ef-or a fragment thereof derived from human 
or mouse. 

[0033] As a particularly preferred embodiment, it means any protein or a fragment thereof 
below. 

(1) A protein or a fragment thereof comprising the amino acid sequence of 
SEQIDNO:2or8. 

(2) A protein or a fragment thereof comprising substantially the same 
amino acid sequence as that of SEQ ID NO:2 or 8 and having a cytidine deaminase 
activity. 

(3) A mammal-derived protein that comprises an amino acid sequence 
encoded by a DNA hybridizing under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO: 1, that is homologous to a protein comprising the 
amino acid sequence of SEQ ID NO:2, and that has a cytidine deaminase activity. 

(4) A mammal-derived protein that comprises an amino acid sequence 
encoded by a DNA hybridizing under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO:7, that is homologous to a protein comprising the 
amino acid sequence of SEQ ID NO: 8, and that has a cytidine deaminase activity. 

[0034] Here, "having substantially the same amino acid sequence" means that a protein has 
an amino acid sequence where multiple amino acids, preferably 1 to 10 amino acids, 
particularly preferably 1 to 5 amino acids, in the amino acid sequence shown in the 
references are substituted, deleted, and/or modified, and that a protein has an amino acid 
sequence where multiple amino acids, preferably 1 to 10 amino acids, particularly preferably 
1 to 5 amino acids, are added to the amino acid sequence shown in the references. 
[0035] The protein of the present invention includes monomer molecules, homodimers in 
which one strand binds to another strand comprising an identical amino acid sequence, 
heterodimers in which one strand binds to another strand comprising a different amino acid 
sequence, and oligomers such as trimers or tetramers. 

[0036] Also, the-a_"fragment of a protein" means an arbitrary partial sequence (fragment) in 
the amino acid sequence that the above-mentioned AID protein of the present invention 
comprises. For example, it includes an enzyme active site required for the AID protein to 
exert an enzyme activity represented by a cytidine deaminase activity, and an interaction site 
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required for the AH) protein to bind or associate with substrates (e.g. mRNA, etc.) or various 
auxiliary proteins. 

[0037] Alphabetical triplet or single letter codes used to represent amino acids in the present 
specification or figures mean amino acids as follows: 
5 [0038] (Gly/G), glycine; (Ala/A), alanine; (Val/V), valine; (Leu/L), leucine; (Ile/I), 

isoleucine; (Ser/S), serine; (Thr/T), threonine; (Asp/D), aspartic acid; (Glu/E), glutamic acid; 
(Asn/N), asparagines; (Gln/Q) glutamine; (Lys/K), lysine; (Arg/R), arginine; (Cys/C), 
cysteine; (Met/M), methionine; (Phe/F), phenylalanine; (Tyr/Y), tyrosine; (Trp/W), 
tryptophan; (His/H), histidine; (Pro/P), proline. 

10 [0039] The proteins and fragments of the present invention can be produced by properly 

using, in addition to genetic engineering technique mentioned below, methods well known in 
the art, such as chemical synthesis, cell culture method, and so on, or their modified methods. 
[0040] Also, the AID protein of the present invention can be produced as a recombinant 
fusion protein with another protein (e.g. GST (Glutathione S-transferase), etc.). In this case, 

15 the fusion protein is advantageous in that it can be extremely easily purified by affinity 
chromatography employing adsorbent on which another prot e in molecule binding 
specifically to GST is immobilized. Moreover, since various antibodies reactive to GST are 
provided, the quantification of the fusion protein can be simply carried out by immunoassay 
(e.g. ELISA, etc.) using-the antibodies against GST. 

20 [0041] The DNA of the present invention is a DNA encodings protein of the present 
invention and a fragment thereof, and it includes any nucleotide sequence encoding the 
protein of the present invention and includes both genomic DNAs and cDNAs. Also, the 
DNA includes any DNA composed of any codons as long as the codons encode identical 
amino acids. 

25 [0042] Also, the DNA of the present invention includes a DNA encoding a mammalian AID 
protein, and, as a preferred embodiment, a DNA encodings mouse AID protein or _a human 
AID protein can be exemplified. 

[0043] Examples of specific embodiments are as follows: 

(1) A DNA encoding a protein comprising the amino acid sequence of SEQ ID 
30 NO:2 or 8. 

(2) The DNA of ( 1 ) , wherein the protein has a cytidine deaminase activity. 
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(3) A DNA comprising the nucleotide sequences of SEQ ID NO: 1 or 7. 

(4) A DNA comprising the-nucleotides residue s 93 to 689 of SEQ ID NO : 1 . 

(5) A DNA comprising the-nucleotides rooiduos 80 to 676 of SEQ ID NO:7. 

(6) A DNA that hybridizes under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO:l and that encodes a mammal-derived protein 
being homologous to a protein that comprises the amino acid sequence of SEQ ID 
NO:2 and having a cytidine deaminase activity. 

(7) A DNA that hybridizes under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO:7 and that encodes a mammal-derived protein 
being homologous to a protein that comprises the amino acid sequence of SEQ ID 
NO: 8 and having a cytidine deaminase activity. 

(8) A genomic DNA or a fragment thereof comprising a nucleotide sequence of 
any one of (a) to (c) below: 

(a) SEQ ID NO:9, 

(b) SEQ ID NO: 10, or 

(c) SEQ ID NO:35. 

(9) A genomic DNA or a fragment thereof comprising a nucleotide sequence of 
any one of (a) to (e) below: 

(a) SEQ ID NO: 11, 

(b) SEQ ID NO: 12, 

(c) SEQ ID NO: 13, 

(d) SEQ ID NO: 14, or 

(e) SEQ ID NO: 15. 

(10) A DNA comprising a complementary nucleotide sequence to an arbitrary 
partial sequence of a nucleotide sequence of any one of (a) to (h) below: 

(a) SEQ ID NO:9, 

(b) SEQ ID NO: 10, 

(c) SEQ ID NO: 11, 

(d) SEQ ID NO: 12, 

(e) SEQ ID NO: 13, 

(f) SEQ ID NO: 14, 
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(g) SEQIDNO:15,or 

(h) SEQIDNO:35. 

(11) A DNA comprising a nucleotide sequence of any one of (a) to (q) below: 
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(1) 


SEQIDNO:29, 




(m) 


SEQ ID NO:30, 




(n) 


SEQ IDN0:31, 




(o) 


SEQ ID NO:32, 




(P) 


SEQ ID NO:33, or, 


20 


(q) 


SEQIDNO:34. 



[0044] Furthermore, a DNA encoding a mutant protein or a fragment thereof obtained by 
substituting, deleting, and/or modifying multiple amino acids, preferably 1 to 10 amino acids, 
particularly preferably 1 to 5 amino acids, or by inserting multiple amino acids, preferably 
1 to 10 amino acids, particularly preferably 1 to 5 amino acids in the amino acid sequence 
25 constituting the above-defined AID protein of the present invention or a fragment thereof is 
included in the DNA of the present invention. 

[0045] The term "under stringent conditions" used herein means, for example, the following 
conditions. For example, in the case of carrying out hybridization using a probe with not less 
than 50 bases in 0.9% NaCl, target temperature of causing 50% dissociation (Tm) can be 
30 calculated from the formula below, and the hybridization temperature can be set as the 
formula below. 
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Tm = 82.3°C+0.41 '(G+C)%-500/n-0.61x (formamide)% 
(n means the number of bases of the probe) 

Temperature = Tm-25°C 
[0046] Also, in the case of using a probe with not less than 100 bases (G+C = 40 to 50%), 
the changes of Tm as (1) and (2) below can be used as the indicator. 

(1) Every 1% mismatch decreases Tm by approximately 1°C. 

(2) Every 1% formamide decreases Tm by 0.6 to 0.7°C. 
[0047] Thus, the temperature condition in the case of combination of complete 
complementary strands can be set as below. 

(A) 65 to 75°C (without formamide) 

(B) 35 to 45°C (with 50% formamide) 

[0048] The temperature condition in the case of combination of incomplete complementary 
strands can be set as below. 

(A) 45 to 55°C (without formamide) 

(B) 35 to 42°C (with 30% formamide) 

[0049] In the case of using probes with not more than 23 bases, temperature can be 37°C, or 
the formula below can also be used as an indicator. 

Temperature = 2°C x (number of A+T)+4°C x (number of C+G)-5°C 
[0050] The DNA of the present invention can be a DNA obtained by any method. For 
example, the DNA includes complementary DNA (cDNA) prepared from mRNA, DNA 
prepared from genomic DNA, DNA prepared by chemical synthesis, DNA obtained by PCR 
amplification with RNA or DNA as a template, and DNA constructed by appropriately 
combining these methods. 

[0051] As used herein, an "isolated nucleic acid" is a nucleic acid, the structure of which is 
not identical to that of any naturally occurring nucleic acid or to that of any fragment of a 
naturally occurring genomic nucleic acid spanning more than three genes. The term 
therefore covers, for example, (a) a DNA which has the sequence of part of a naturally 
occurring genomic DNA molecule but is not flanked by both of the coding sequences that 
flank that part of the molecule in the genome of the organism in which it naturally occurs; (b) 
a nucleic acid incorporated into a vector or into the genomic DNA of a prokaryote or 
eukaryote in a manner such that the resulting molecule is not identical to any naturally 
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occurring vector or genomic DNA; (c) a separate molecule such as a cDNA, a genomic 
fragment, a fragment produced by polymerase chain reaction (PCR), or a restriction 
fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene 
encoding a fusion protein. Specifically excluded from this definition are nucleic acids 
5 present in random, uncharacterized mixtures of different DNA molecules, transfected cells, 
or cell clones, e.g., as these occur in a DNA library such as a cDNA or genomic DNA library. 
[0052] The term "substantially pure" as used herein in reference to a given polypeptide 
means that the polypeptide is substantially free from other biological macromolecules. For 
example, the substantially pure polypeptide is at least 75%, 80, 85, 95, or 99% pure by dry 

10 weight. Purity can be measured by any appropriate standard method known in the art, for 
example, by column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 
[0053] The invention also includes a polypeptide, or fragment thereof, that differs from the 
corresponding sequence shown as SEQ ID NO:2 or 8. The differences are, preferably, 
differences or changes at a non-essential residue or a conservative substitution. In one 

15 embodiment, the polypeptide includes an amino acid sequence at least about 60% identical to 
a sequence shown as SEQ ID NO:2 or 8, or a fragment thereof. Preferably, the polypeptide is 
at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more identical to SEQ ID 
NO:2 or 8 and has at least one cytidine deaminase function or activity described herein. 
Preferred polypeptide fragments of the invention are at least 10%, preferably at least 20%, 

20 30%, 40%, 50%, 60%, 70% o , or more, of the length of the sequence shown as SEQ ID NO;2 
or 8 and have at least one cytidine deaminase activity described herein. Or alternatively, the 
fragment can be merely an immunogenic fragment. 

[0054] As used herein, "% identity" of two amino acid sequences, or of two nucleic acid 
sequences, is determined using the algorithm of Karlin and Altschul (PNAS USA 87:2264- 

25 2268, 1990), modified as in Karlin and Altschul, PNAS USA 90:5873-5877, 1993). Such an 
algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al. (J. 
Mol. Biol. 215:403-410, 1990). BLAST nucleotide searches are performed with the 
NBLAST program, score = 100, wordlength =12. BLAST protein searches are performed 
with the XBLAST program, score =50, wordlength=3. To obtain gapped alignment for 

30 comparison purposes GappedBLAST is utilized as described in Altschul et al (Nucleic Acids 
Res. 25:3389-3402, 1997). When utilizing BLAST and GappedBLAST programs the default 



parameters of the respective programs (e.g., XBLAST and NBLAST) are used to obtain 
nucleotide sequences homologous to a nucleic acid molecule of the invention. 
[0055] Accordingly, in one aspect, the invention provides an isolated or purified nucleic acid 
molecule that encodes a polypeptide described herein or a fragment thereof. Preferably, the 
5 isolated nucleic acid molecule includes a nucleotide sequence that is at least 60% identical to 
the nucleotide sequence shown in SEQ ID NO:l or 7. More preferably, the isolated nucleic 
acid molecule is at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%>, 97%, 98%>, 99%), or more, identical to the nucleotide sequence shown in SEQ ED NO:l 
or 7. In the case of an isolated nucleic acid molecule which is longer than or equivalent in 
10 length to the reference sequence, e.g., SEQ ID NO: 1 or 7, the comparison is made with the 
full length of the reference sequence. Where the isolated nucleic acid molecule is shorter that 
the reference sequence, e.g., shorter than SEQ ID NO:l or 7, the comparison is made to a 
segment of the reference sequence of the same length (excluding any loop required by the 
homology calculation). 

15 [0056] The DNA encoding the protein of the present invention can be prepared by the usual 
methods: cloning cDNA from mRNA encoding the protein of the present invention, isolating 
genomic DNA and splicing it, chemical synthesis, and so on. 

[0057] (1) cDNA can be cloned from mRNA encoding the protein of the present invention 

20 by, for example, the method described below. 

[0058] First, the mRNA encoding the protein of the present invention is prepared from the 
above-mentioned tissues or cells expressing and producing the protein of the present 
invention. mRNA can be prepared by isolating total RNA by a known method such as 
guanidine-thiocyanate method (Chirgwin et al., Biochemistry, 18:5294, 1979), hot phenol 

25 method, or AGPC method, and subjecting it to affinity chromatography using oligo-dT 
cellulose or poly-U Sepharose. 

[0059] Then, with the mRNA obtained as a template, cDNA is synthesized, for example, by a 
well-known method using reverse transcriptase, such as the method of Okayama et al (Mol. 
Cell. Biol. 2:161 (1982); Mol. Cell. Biol. 3:280 (1983)) or the method of Hoffinan et al. 
30 (Gene 25:263 (1983)), and converted into double-stranded cDNA. A cDNA library is 
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prepared by transforming E. coli with plasmid vectors, phage vectors, or cosmid vectors 
having this cDNA or by transfecting E. coli after in vitro packaging. 
[0060] The plasmid vectors used in this invention are not limited as long as they are 
replicated and maintained in hosts. Any phage vector that can be replicated in hosts can also 
5 be used. Examples of usually used cloning vectors are pUC19, XgtlO, X-gtl 1, and so on. 
When the vector is applied to immunological screening as mentioned below, a vector having 
a promoter that can express a gene encoding the desired protein in a host is preferably used. 
[0061] cDNA can be inserted into a plasmid by, for example, the method of Maniatis et al. 
(Molecular Cloning, A Laboratory Manual, second edition, Cold Spring Harbor Laboratory, 

10 p. 1.53, 1989). cDNA can be inserted into a phage vector by, for example, the method of 
Hyunh et al (DNA cloning, a practical approach, 1, p.49 (1985)). These methods can be 
simply performed by using a commercially available cloning kit (for example, a product from 
Takara Shuzo). The recombinant plasmid or phage vector thus obtained is introduced into an 
appropriate host cell such as a prokaryote (for example, E. coli: HB101, DH5a, MC1061/P3, 

15 etc). 

[0062] Examples of a method for introducing a plasmid into a host are, calcium chloride 
method, calcium chloride/rubidium chloride method and electroporation method, described 
in Molecular Cloning, A Laboratory Manual (second edition, Cold Spring Harbor Laboratory, 
p. 1.74 (1989)). Phage vectors can be introduced into host cells by, for example, a method in 
20 which the phage DNAs are introduced into grown hosts after in vitro packaging. In vitro 
packaging can be easily performed with a commercially available in vitro packaging kit (for 
example, a product from Stratagene or Amersham). 

[0063] The identification of cDNA encoding protein, its expression being augmented 
depending on the stimulation of cytokines like AID protein of the present invention, can be 

25 carried out by for example suppression subtractive hybridization (SSH)(Proc. Natl. Acad. 
Sci. USA, 93:6025-6030, 1996; Anal. Biochem., 240:90-97, 1996) taking advantage of 
suppressive PCR effect (Nucleic Acids Res., 23:1087-1088, 1995), using two cDNA libraries, 
namely, a^cDNA library constructed from mRNA derived from stimulated cells (tester cDNA 
library) and tha ^one constructed from mRNA derived from unstimulated cells (driver cDNA 

30 library). 
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[0064] The preparation of cDNA libraries required for subtraction cloning can be performed 
by using_a commercially available kit, for example, PCR-Select Subtraction Kit 
(CLONTECH, cat: Kl 804-1). The experiment can be performed according to the docum e nt 
of proc e dur e instructions accompanying in-the kit. 
5 [0065] An example of a_practical experimental procedure is listed below, briefly. 

[0066] PolyA + RNA is prepared from cells with or without stimulation with appropriate 
stimulant as previously reported m e thod (Nucleic Acids Res., 26:911-918, 1998). Next, 
cDNA is prepared^ using reverse transcriptasei from each polyA + RNA samples, as is the 
commonly used method. cDNA prepared from stimulated cells is used as tester cDNA and 

10 that prepared from unstimulated cells as driver cDNA. 

[0067] According to the previous report mentioned above and experimental manuals 
accompanying with-the kit, driver cDNA is added to tester cDNA to perform subtraction. 
The efficiency of subtraction is monitored by adding small amount of exogenous DNA as a 
control. After subtraction, the exogenous DNA is concentrated. 

15 [0068] The subtracted cDNA is cloned into an appropriate plasmid expression vector to 
construct a plasmid library b^ -by a commonly used method. 

[0069] Similar to the previously report ed method , many colonies are screened by differential 
hybridization method (Nucleic Acids Res., 26:911-918, 1998; RINSYO-MEN-EKI, 29:451- 
459, 1997). Here, as the hybridization probes, tester cDNA and driver cDNA mentioned 
20 above labeled with radioisotope can be used. Clones containing the objective DNA and that 
or containing exogenous DNA can be distinguished by hybridizing the exogenous DNA with 
replicant filters. 

[0070] Objective cDNA or its fragment can be obtained by selecting clones giving strong 
signals against radiolabeled tester cDNA probe rather than radiolabeled driver cDNA probe. 
25 [0071] Also, cDNA encoding the protein of the present invention can be accomplished by 
other general cDNA screening methods. 

[0072] For instance, cDNA or its-a_fragment encoding the protein of the present invention 
cloned by subtraction cloning method mentioned above, or chemically synthesized 
oligonucleotides corresponding to an amino acid sequence of the protein of the present 
30 invention, are labeled with 32 P to make probes, then by well-known colony hybridization 

methods (Crunstein et al., Proc. Natl. Acid. Sci. USA, 72:3961, 1975) or plaque hybridization 
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methods (Molecular Cloning, A Laboratory Manual, second edition, Cold Spring Harbor 
Laboratory, p.2.108, 1989), commercial or originally prepared cDNA libraries can be 
screened. Furthermore, a method to amplify DNA including cDNA encoding the protein of 
the present invention by PCR, by constructing a pair of PCR primers based on cDNA or its 
5 fragment encoding the protein of the present invention isolated by the subtraction cloning 
mentioned above, can be listed. 

[0073] When a cDNA library prepared using a cDNA expression vector is used, the desired 
clone can be screened by the antigen-antibody reaction using an antibody against the desired 
protein. A screening method using PCR m e thod m ethodology is preferably used when many 

10 clones are subjected to screening. 

[0074] The nucleotide sequence of the DNA thus obtained can be determined by the Maxam- 
Gilbert method (Maxam et al., Proc. Natl. Acad. Sci. USA, 74:560 (1977)) or the 
dideoxynucleotide synthetic chain termination method using phage Ml 3 (Sanger et aL A Proc. 
Natl. Acad. Sci. USA, 74:5463-5467 (1977)). The nucleotide sequence can be easily 

1 5 determined using a commercial DNA sequencer. 

[0075] The whole or a part of the gene encoding the protein of the present invention can be 
obtained by excising the clone obtained as mentioned above with restriction enzymes and so 
on. 

20 [0076] (2) Also, the DNA encoding the protein of the present invention can be isolated 

from the-genomic DNA derived from the cells expressing the protein of the present invention 
as mentioned above by the following methods. 

[0077] Such cells are solubilized preferably by SDS or proteinase K, and the DNAs are 
deproteinized by repeating phenol extraction. RNAs are digested preferably with 

25 ribonuclease. The DNAs obtained are partially digested with appropriate restriction 

enzymes, and the DNA fragments obtained are amplified with appropriate phage or cosmid to 
generate a library. Then, clones having the desired sequence are detected, for example, by 
using radioactively labeled DNA probes, and the whole or a portion of the gene encoding the 
protein of the present invention is obtained from the clones by excision with restriction 

30 enzymes, etc. 
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[0078] For example, cDNA encoding a human-derived protein can be obtained by preparing 
a cosmid library into which human genomic DNAs (chromosomal DNAs) are introduced 
("Laboratory Manual Human Genome Mapping," M. Hori and Y. Nakamura, eds., 
Maruzen), screening the cosmid library to obtain positive clones containing DNA 
5 corresponding to the coding region of the desired protein, and screening the above cDNA 
library using the coding region DNA excised from the positive clones as a probe. 
[0079] Also, the present invention relates to any fragment of DNA (cDNA, genomic DNA, 
etc.) encoding an AID protein (especially_a human AID protein) of the present invention 
described above. DNA with ^compl e m e ntary nucleotide sequence complementary to any 
1 0 nucleotide sequence of cDNA or genomic DNA is useful as a primer DNA in polymerase 
chain reaction (PCR). By PCR using a pair of the-primer DNAs, any partial nucleotide 
sequence of genomic DNA encoding AID protein (especially human AID protein) of the 
present invention can be amplified. 

[0080] For instance, in the case that mutation or deletion of genomic DNA (especially exon) 
15 encoding the AID protein is presumed to cause a certain immunodeficiency or allergy, the 
existence of such tbe-amutation or deletion can be analyzed by PCR described below. 

( 1 ) Prepare a pair of primers comprising compl e m e ntary nucleotide 
sequence complementary to any partial nucleotide sequence of genomic DNA 
encoding an AID protein of the present invention. 
20 (2) Amplify the objective partial nucleotide sequence of the genomic 

DNA using the pair of primers, using genomic DNA encoding AID protein obtained 
from tissue or cells of immunodeficiency or allergy patients as templates. 

(3) Analyze the existence of PCR products and the nucleotide sequence of 
the PCR products, and identify the mutation and deletion in the genomic DNA by 
25 comparing the nucleotide sequence and corresponding nucleotide sequence of 

genomic DNA encoding AID protein derived from anormal human. 
[0081] Thus, the method described above can not only elucidate, for example, the 
relation ship between immunodeficiency and/or allergy and AID protein, but also be used for 
the diagnosis of a certain kind of disease, in the case that AID protein is the cause of the 
30 disease. 

[0082] Examples of the nucleotide sequence of the primer DNA are as follows: 
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(1) A DNA comprising a complementary nucleotide sequence to an arbitrary 
partial sequence of a nucleotide sequence of any one of (a) to (h) below: 





(a) 


isisQ id NU:y, 




(o) 


de,Q ID JNU:1U, 


5 


(c) 


SEQIDNO:ll, 




(d) 


SEQIDNO:12, 




(e) 


SEQIDNO:13, 




(f) 


SEQIDNO:14, 




(g) 


SEQIDNO:15 s or 


10 


(h) 


SEQ ID NO:35. 



(2) A DNA comprising a nucleotide sequence of any one of (a) to (q) below: 



(a) 


SEQ ID NO: 18, 


(b)' 


SEQ ID NO: 19, 


(c) 


SEQ ID NO:20, 


(d) 


SEQIDNO:21, 


(e) 


SEQ ID NO:22, 


(f) 


SEQIDNO:23, 


(g) 


SEQ ID NO:24, 


(h) 


SEQIDNO:25, 


(i) 


SEQ ID NO:26, 


0) 


SEQ ID NO:27, 


(k) 


SEQIDNO:28, 


(1) 


SEQ ID NO:29, 


(m) 


SEQ ID NO:30, 


(n) 


SEQIDNO:31, 


(o) 


SEQ ID NO:32, 


(P) 


SEQ ID NO:33, or, 


(q) 


SEQ ID NO:34. 



[0083] Also, the present invention relates to the use of the above-mentioned DNA fragment 
30 as a primer DNA in polymerase chain reaction. 
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[0084] Examples of the combination of primer DNAs for PCR in diagnosis accomplished by 
PCR gene amplification and by analyzing it are as follows: 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO: 3 1 and a 
DNA comprising the nucleotide sequence of SEQ ED NO:32, 

(2) a DNA comprising the nucleotide sequence of SEQ ID NO:20 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:22, 

(3) a DNA comprising the nucleotide sequence of SEQ ID NO:21 and a 
DNA comprising the nucleotide sequence of SEQ ID NO: 30, 

(4) a DNA comprising the nucleotide sequence of SEQ ID NO: 24 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:25, 

(5) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:27, 

(6) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:28, 

(7) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(8) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:27, 

(9) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:28, 

(10) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(1 1) a DNA comprising the nucleotide sequence of SEQ ID NO:34 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:28, 

(12) a DNA comprising the nucleotide sequence of SEQ ID NO:34 and a 
DNA comprising the nucleotide sequence of SEQ ID NO: 29, 

(13) a DNA comprising the nucleotide sequence of SEQ ID NO:33 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, or, 

(14) a DNA comprising the nucleotide sequence of SEQ ID NO: 1 8 and a 
DNA comprising the nucleotide sequence of SEQ ID NO: 19. 
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[0085] Moreover, the present invention also relates to a recombinant vector comprising the 
DNA encoding the protein of the present invention. As a recombinant vector of the present 
invention, any vector can be used as long as it is capable of retaining replication or self- 
multiplication in each host cell of prokaryotic and/or eukaryotic cells, including plasmid 
5 vectors and phage vectors. 

[0086] The recombinant vector can easily be prepared by ligating the DNA encodings 
protein of the present invention with a vector for recombination available in the art (plasmid 
DNA and bacteriophage DNA) by the usual method. 

[0087] Specific examples of the vectors used for recombination used-are E. co/z-derived 
10 plasmids such as pBR322, pBR325, pUC12, pUC13, and pUC19, yeast-derived plasmids 
such as pSH19 and pSH15, and Bacillus subtilis-derived plasmids such as pUBHO, pTP5, 
and pC194. Examples of phages are a bacteriophage such as A, phage, and an animal or 
insect virus (pVL1393, Invitrogen) such as a retrovirus, vaccinia virus, and nuclear 
polyhedrosis virus. 

15 [0088] An expression vector is useful for expressing the DNA encoding the protein of the 
present invention and for producing the protein of the present invention. The expression 
vector is not limited as long as it expresses the gene encoding the protein of the present 
invention in various prokaryotic and/or eukaryotic host cells and produces this protein. 
Examples thereof are pMAL C2, pEF-BOS (Nucleic Acids Res. 18:5322 (1990) and so on), 

20 pME18S (Experimental Medicine: SUPPLEMENT, "Handbook of Genetic Engineering" 
(1992) and so on), etc. 

[0089] Also, the protein of the present invention can be produced as a fusion protein with 
other proteins. It can be prepared as a fusion protein, for example, with GST (Glutathione 
S-transferase) by subcloning a cDNA encoding the protein of the present invention, for 
25 example, into plasmid pGEX4Tl (Pharmacia), by transforming E. coli DH5a, and by 
culturing the transformant. 

[0090] When bacteria, particularly E. coli^ are used as host cells, an expression vector 
generally comprises, at least, a promoter/operator region, an initiation codon, the DNA 
encoding the protein of the present invention, termination codon, terminator region, and 
30 replicon. 
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[0091] When yeast, animal cells, or insect cells are used as hosts, an expression vector is 
preferably comprising, at least, a promoter, an initiation codon, the DNA encoding the 
protein of the present invention, and a termination codon. It may also comprise the DNA 
encoding a signal peptide, enhancer sequence, 5'- and 3 '-untranslated region of the gene 
5 encoding the protein of the present invention, splicing junctions, polyadenylation site, 
selectable marker region, and ajeplicon. The expression vector may also contain, if 
required, a gene for gene amplification (marker) that is usually used. 
[0092] A promoter/operator region to express the protein of the present invention in bacteria 
comprises a promoter, an operator, and a Shine-Dalgarno (SD) sequence (for example, 

10 AAGG). For example, when the host is Escherichia, it preferably comprises Trp promoter, 
lac promoter, rec A promoter, XPL promoter, Ipp promoter, tac promoter, or the like. 
Examples of a promoter to express the protein of the present invention in yeast are PH05 
promoter, PGK promoter, GAP promoter, ADH promoter, and so on. When the host is 
Bacillus, examples thereof are SL01 promoter, SP02 promoter, penP promoter, and so on. 

15 When the host is a eukaryotic cell such as a mammalian cell, examples thereof are 

S V40-derived promoter, retrovirus promoter, heat shock promoter, and so on, and preferably 
an_SV-40 and-or retro virus-derived one. As a matter of course, the promoter is not limited to 
the above examples. In addition, using an enhancer is effective for expression. 
[0093] A preferable initiation codon is, for example, a methionine codon (ATG). 

20 [0094] A commonly used termination codon (for example, TAQ TAA, TGA) is exemplified 
as a termination codon. 

[0095] Usually, used-natural or synthetic terminators are used as a terminator region. 
[0096] Areplicon means a DNA capable of replicating the whole DNA sequence in host 
cells, and includes a natural plasmid, an artificially modified plasmid (DNA fragment 

25 prepared from a natural plasmid), a synthetic plasmid, and so on. Examples of preferable 
plasmids are pBR322 or its artificial derivatives (DNA fragment obtained by treating 
pBR322 with appropriate restriction enzymes) for E. coli, yeast 2 ju, plasmid or yeast 
chromosomal DNA for yeast, and pRSVneo ATCC 37198, pSV2dhfr ATCC 37145, pdBPV- 
MMTneo ATCC 37224, pSV2neo ATCC 37149, and such for mammalian cells. 

30 [0097] An enhancer sequence, polyadenylation site, and splicing junction that are usually 
used in the art, such as those derived from SV40.! can also be used. 
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[0098] A selectable marker usually employed can be used according to the usual method. 
Examples thereof are resistance genes for antibiotics, such as tetracycline, ampicillin, or 
kanamycin. 

[0099] Examples of genes for gene amplification are dihydrofolate reductase (DHFR) gene, 
5 thymidine kinase gene, neomycin resistance gene, glutamate synthase gene, adenosine 
deaminase gene, ornithine decarboxylase gene, hygromycin-B- 
phophotransf e ras e phosphotransferase gene, aspartate transcarbamylase gene, etc. 
[00100] The expression vector of the present invention can be prepared by 
continuously and circularly linking at least the above-mentioned promoter, initiation codon, 
10 DNA encoding the protein of the present invention, termination codon, and terminator 
region, to an appropriate replicon. If desired, appropriate DNA fragments (for example, 
linkers, restriction sites, and so on), can be used by the usual method such as digestion with a 
restriction enzyme or ligation using T4 DNA ligase. 

[00101] Transformants of the present invention can be prepared by introducing the 

15 expression vector mentioned above into host cells. 

[00102] Host cells used in the present invention are not limited as long as they are 
compatible with an expression vector mentioned above and can be transformed. Examples 
thereof are various cells such as wild-type cells or artificially established recombinant cells 
usually used in the technical field of the present invention (for example, bacteria 

20 {Escherichia and Bacillus), yeast (Saccharomyces, Pichia, and such), animal cells, or insect 
cells). 

[00103] E. coli or animal cells are preferably used. Specific examples are E. coli 
(DH5a, TBI, HB101, and such), mouse-derived cells (COP, L, C127, Sp2/0, NS-1, NIH 3T3, 
and such), rat-derived cells (PC12, PC12h), hamster-derived cells (BHK, CHO, and such), 
25 monkey-derived cells (COS1, COS3, COS7, CV1, Velo, and such), and human-derived cells 
(Hela, diploid fibroblast-derived cells, myeloma cells, and HepG2, and such). 
[00104] An expression vector can be introduced (transformed (transfected)) into host 
cells by known methods. 

[00105] Transformation can be performed, for example, according to the method of 
30 Cohen et al. (Proc. Natl. Acad. Sci. USA, 69:2110 (1972)),_the protoplast method (Mol. Gen. 
Genet., 168:111 (1979)), or the competent method (J. Mol. Biol., 56:209 (1971)) when the 



hosts are bacteria (E. coli, Bacillus subtilis, and such), the method of Hinnen et al. (Proc. 
Natl. Acad. Sci. USA, 75:1927 (1978)), or the lithium method (J. Bacteriol., 153:163 (1983)) 
when the host is Saccharomyces cerevisiae, the method of Graham (Virology, 52:456 (1973)) 
when the hosts are animal cells, and the method of Summers et al. (Mol. Cell. Biol, 3:2156- 
5 2165 (1983)) when the hosts are insect cells. 

[00106] The protein of the present invention can be produced by cultivating 
transformants (in the following, this term includes transfectants) comprising an expression 
vector prepared as mentioned above in nutrient media. 

[00107] The nutrient media preferably comprises a carbon source,_an inorganic 
10 nitrogen source, or- an organic nitrogen source necessary for the growth of host cells 

(transformants). Examples of the carbon source are glucose, dextran, soluble starch, and 
sucrose, and examples of the inorganic or organic nitrogen source are ammonium salts, 
nitrates, amino acids, corn steep liquor, peptone, casein, meet- meat extract, soy bean cake, 
and potato extract. If desired, they may comprise other nutrients (for example, an inorganic 
15 salt (for example, calcium chloride, sodium dihydrogenphosphate, and magnesium chloride), 
vitamins, antibiotics (for example, tetracycline, neomycin, ampicillin, kanamycin, and so on). 
. [00108] Cultivation is performed by a method known in the art. Cultivation conditions 
such as temperature, pH of the media, and cultivation time are selected appropriately so that 
the protein of the present invention is produced in large quantities. 
20 [00109] Specific media and cultivation conditions used depending on host cells are 

illustrated below, but are not limited thereto. 

[00110] When the hosts are bacteria, actinomycetes, yeasts, or filamentous fungi, 
liquid media comprising the nutrient source mentioned above are appropriate. Th e m Media 
with apH of 5 to 8 are preferably used. 
25 [00111] When the host is E. coli, examples of preferable media are LB media, M9 

media (Miller et al. Exp. Mol. Genet., Cold Spring Harbor Laboratory, p.431 (1972)), and so 
on. Using these media, cultivation can be performed usually at 14 to 43 °C for about 3 to 
24 hours with aeration and stirring, if necessary. 

[00112] When the host is Bacillus, cultivation can be performed usually at 30 to 40°C 
30 for about 16 to 96 hours with aeration and stirring, if necessary. 
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[00113] When the host is yeast, an example of medium m edians Burkholder minimal 
medium m edta-(Bostian, Proc. Natl. Acad. Sci. USA, 77:4505 (1980)). The pH of the 
medium m edia-is preferably 5 to 8. Cultivation can be performed usually at 20 to 35°C for 
about 14 to 144 hours with aeration and stirring, if necessary. 

5 [00114] When the host is an animal cell, examples of media are MEM m e dia 
containing about 5 to 20% fetal bovine serum (Science, 122:501 (1952)), DMEM m e dia 
(Virology, 8:396 (1959)), RPMI1640 medium m edia-d. Am. Med. Assoc., 199:519 (1967)), 
199 medium m edia-(Proc. Soc. Exp. Biol. Med., 73:1 (1950)), and so on. The pH of the 
medium medians preferably about 6 to 8. Cultivation can be performed usually at about 30 

10 to 40°C for about 15 to 72 hours with aeration and stirring, if necessary. 

[00115] When the host is an insect cell, an example of media-medium is Grace's m e dia 
medium containing fetal bovine serum (Proc. Natl. Acad. Sci. USA, 82:8404 (1985)). The 
pH thereof is preferably about 5 to 8. Cultivation can be performed usually at about 20 to 
40°C for 15 to 100 hours with aeration and stirring, if necessary. 

15 [00116] The protein of the present invention can be produced by cultivating 

transformants, especially mammalian cells, as mentioned above and allowing them to secrete 
the protein into the culture supernatant. 

[00117] A culture filtrate (supernatant) is obtained by a method such as filtration or 
centrifugation of the obtained culture, and the protein of the present invention is purified and 
20 isolated from the culture filtrate by methods commonly used in order to purify and isolate a 
natural or synthetic protein. 

[00118] Examples of the isolation and purification method are a method utilizing 
solubility, such as salting out and solvent precipitation method; a method utilizing the 
difference in molecular weight, such as dialysis, ultrafiltration, gel filtration, and sodium 

25 dodecyl sulfate-polyacrylamide gel electrophoresis; a method utilizing charges, such as ion 
exchange chromatography and hydroxylapatite chromatography; a method utilizing specific 
affinity, such as affinity column chromatography; a method utilizing the difference in 
hydrophobicity, such as reverse phase high performance liquid chromatography; and a 
method utilizing the difference in isoelectric point, such as isoelectric focusing. 

30 [00119] When the protein of the present invention exists in the periplasm or cytoplasm 

of cultured transformants (for example, E. coli\ first, the cells are harvested by a usual 



method such as filtration or centrifugation and suspended in appropriate buffer. After the cell 
wall and/or cell membrane of the cells and such are disrupted by the-amethod such as lysis 
with sonication, lysozyme, and freeze-thawing, the membrane fraction comprising the 
protein of the present invention is obtained by the-amethod such as centrifugation or 
5 filtration. The membrane fraction is solubilized with a detergent such as Triton-XlOO to 

obtain the crude extract. Finally, the protein is isolated and purified from the crude extract by 
the-ausual method as illustrated above. 

[00120] By using a DNA (cDNA or genomic DNA) encoding a human-derived AID 
protein included in the protein of the present invention, transgenic non-human mammals 

10 secreting the human AID protein in their body can be prepared. Namely, by integrating the 
human-derived DNA into an endogenous locus of non-human mammals (e.g. mouse), the 
human AID protein of the present invention encoded by the DNA is expressed and secreted 
in their body. The transgenic non-human mammals are included in the present invention. 
[00121] The transgenic non-human mammals can be prepared according to the method 

15 usually used for producing a transgenic animal (for example, see "Newest Manual of Animal 
Cell Experiments LIC press, Chapter 7, pp.361-408, (1990)). 

[00122] Specifically, for example, a transgenic mouse can be produced as follows. 
Embryonic stem cells (ES cells) obtained from normal mouse blastocysts are transformed 
with an expression vector in which the gene encoding the human AID protein of the present 

20 invention and a marker gene (for example, neomycin resistance gene) have been inserted in 
an expressible manner. ES cells in which the gene encoding the human AID protein of the 
present invention has been integrated into the endogenous gene are screened by a usual 
method based on expression of the marker gene. Then, the ES cells screened are 
microinjected into a fertilized egg (blastocyst) obtained from another normal mouse (Proc. 

25 Natl. Acad. Sci. USA, 77:7380-7384 (1980); U.S. Pat. No. 4,873,191). 

[00123] The blastocyst is transplanted into the uterus of another normal mouse as the 
foster mother. Then, founder mice are born from the foster mother. By mating the founder 
mice with normal mice, heterozygous transgenic mice are obtained. By mating the 
heterozygous transgenic mice with each other, homozygous transgenic mice are obtained 

30 according to Mendel's laws. 
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[00124] Also, a_so-called "knockout mouse" can be generated based on the nucleotide 
sequence of DNA encoding mouse AID protein included in the present invention. The 
"knockout mouse" in the present invention means the mouse in which the endogenous gene 
encoding the mouse AID protein of the present invention is knocked-out (inactivated). For 
5 example, it can be generated by positive-negative selection method applying homologous 
recombination (U.S. Pat. Nos. 5,464,764; 5,487,992; 5,627,059; Proc. Natl. Acad. Sci. USA, 
86:8932-8935, 1989; Nature, 342:435-438, 1989; etc.), and such knockout mice are one 
embodiment of the present invention. 

[00125] The "antibody" in the present invention means a polyclonal antibody 
10 (antiserum) or a monoclonal antibody, and preferably a monoclonal antibody. 

[00126] Specifically, it includes an antibody reactive to the above-mentioned protein 
of the present invention and a fragment thereof. 

[00127] The "antibody" of the present invention also includes a natural antibody that 
can be prepared by immunizing mammals such as mice, rats, hamsters, guinea pigs, or 

15 rabbits with the protein of the present invention (including natural, recombinant, and 
chemically synthesized protein and cell), a fragment thereof, or a transformant highly 
expressing the protein of interest by recombinant technology mentioned above; a chimeric 
antibody and a humanized antibody (CDR-grafted antibody) that can be produced by 
recombinant technology; and a human monoclonal antibody that can be produced by using 

20 human antibody-producing transgenic animals. 

[00128] The monoclonal antibody includes those having any one of the isotypes of 
IgG, IgM, IgA, IgD, or IgE. IgG or IgM is preferable. 

[00129] The polyclonal antibody (antiserum) or monoclonal antibody of the present 
invention can be produced by known methods. Namely, mammals, preferably, mice, rats, 

25 hamsters, guinea pigs, rabbits, cats, dogs, pigs, goats, horses, or bevinecows, or more 

preferably, mice, rats, hamsters, guinea pigs, or rabbits are immunized, for example, with an 
antigen mentioned above with Freund's adjuvant, if necessary. The polyclonal antibody can 
be obtained from the serum obtained from the animal so immunized. The monoclonal 
antibodies are produced as follows. Hybridomas are produced by fusing the antibody- 

30 producing cells obtained from the animal so immunized and myeloma cells incapable of 
producing autoantibodies. Then the hybridomas are cloned, and clones producing the 
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monoclonal antibodies showing the specific affinity to the antigen used for immunizing the 
mammal are screened. 

[00130] Specifically, the monoclonal antibody can be produced as follows. 
Immunizations are done by injecting or implanting once or several times the above- 
5 mentioned protein of the present invention, a fragment thereof, the cells that express the 
protein, and so on, as an immunogen, if necessary, with Freund's adjuvant, subcutaneously, 
intramuscularly, intravenously, through the footpad, or intraperitoneally into mice, rats, 
hamsters, guinea pigs, or rabbits, preferably mice, rats or hamsters (including transgenic 
animals generated so as to produce antibodies derived from another animal such as the-a 
10 transgenic mouse producing human antibod ies? ). Usually, immunizations are performed 
eeee- one to four times every one to fourteen days after the first immunization. Antibody- 
producing cells are obtained from the mammal so immunized in about one to five days after 
the last immunization. 

[00131] Hybridomas that secrete a monoclonal antibody can be prepared by the 

15 method of Kohler and Milstein (Nature, 256:495-497 (1975)) and by its modified method. 
Namely, hybridomas are prepared by fusing antibody-producing cells contained in a spleen, 
lymph node, bone marrow, or tonsil obtained from the non-human mammal immunized as 
mentioned above, preferably a spleen, with myeloma cells without autoantibody-producing 
ability, which are derived from, preferably, a mammal such as mice, rats, guinea pigs, 

20 hamsters, rabbits, or humans, or more preferably, mice, rats, or humans. 

[00132] For example, mouse-derived myeloma P3/X63-AG8.653 (653; ATCC No. 
CRL1580), P3/NSI/l-Ag4-l (NS-1), P3/X63-Ag8.Ul (P3U1), SP2/0-Agl4 (Sp2/0, Sp2), 
PAI, F0, or BW5147; rat-derived myeloma 210RCY3-Ag.2.3.; or human-derived myeloma 
U-266AR1, GM1500-6TG-A1-2, UC729-6, CEM-AGR, D1R11, or CEM-T15 can be used as 

25 a myeloma used for the cell fusion. 

[00133] Hybridoma clones producing monoclonal antibodies can be screened by 
cultivating the hybridomas, for example, in microtiter plates and by measuring the reactivity 
of the culture supernatant in the well in which hybridoma growth is observed, to the 
immunogen used for the immunization mentioned above, for example, by an enzyme 

30 immunoassay such as RIA and ELIS A. 
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[00134] The monoclonal antibodies can be produced from hybridomas by cultivating 
the hybridomas in vitro or in vivo such as in the ascites of mice, rats, guinea pigs, hamsters, 
or rabbits, preferably mice or rats, more preferably mice and isolating the antibodies from the 
resulting the-culture supernatant or ascites fluid of a mammal. 
5 [00135] In vitro cultivation can be performed depending on the property of cells to be 
cultured, on the object of a test study, and on various culture, by using known nutrient media 
or any nutrient media derived from known basal media for growing, maintaining, and storing 
the hybridomas to produce monoclonal antibodies in the culture supernatant. 
[00136] Examples of basal media are low calcium concentration media such as 

10 Ham'F12 medium, MCDB153 medium, or low calcium concentration MEM medium, and 
high calcium concentration media such as MCDB104 medium, MEM medium, D-MEM 
medium, RPMI1640 medium, ASF104 medium, or RD medium. The basal media can 
contain, for example, sera, hormones, cytokines, and/or various inorganic or organic 
substances depending on the objective. 

1 5 [00137] Monoclonal antibodies can be isolated and purified from the culture 
supernatant or ascites mentioned above by saturated ammonium sulfate precipitation, 
euglobulin precipitation method, caproic acid method, caprylic acid method, ion exchange 
chromatography (DEAE or DE52), affinity chromatography using anti-immunoglobulin 
column or protein A column. 

20 [00138] Furthermore, monoclonal antibodies can be obtained in a large quantity by 

cloning a gene encoding a monoclonal antibody from the hybridoma, generating transgenic 
bovines, goats, sheep, or pigs in which the gene encoding the antibody is integrated in its 
endogenous gene using transgenic animal generating technique, and recovering the 
monoclonal antibody derived from the antibody gene from milk of the transgenic animals 

25 (Nikkei Science, No.4, pp.78-84 (1997)). 

[00139] The "chimeric antibody" of the present invention means a monoclonal 
antibody prepared by genetic engineering, and specifically, a chimeric monoclonal antibody, 
for example, mouse/human chimeric antibody, whose variable region is a mouse 
immunoglobulin-derived variable region and whose constant region is a human 

30 immunoglobulin-derived constant region. 



37 



[00140] The constant region derived from human immunoglobulin has the amino acid 
sequence inherent in each isotype such as IgQ IgM, IgA, IgD, IgE, etc. The constant region 
of the recombinant chimeric monoclonal antibody of the present invention can be that of 
human immunoglobulin belonging to any isotype. Preferably, it is the constant region of 
human IgG 

[00141] The chimeric monoclonal antibody of the present invention can be produced, 
for example, as follows. Needless to say, the production method is not limited thereto. 
[00142] For example, mouse/human chimeric monoclonal antibody can be prepared, 
by referring to Experimental Medicine: SUPPLEMENT, 1.6, No. 10 (1988); and Examined 
Published Japanese Patent Application (JP-B) No. Hei 3-73280. Namely, it can be prepared 
by ligating CH gene (C gene encoding the constant region of H chain) obtained from the 
DNA encoding human immunoglobulin to the downstream of active VH genes (rearranged 
VDJ gene encoding the variable region of H chain) obtained from the DNA encoding mouse 
monoclonal antibody isolated from the hybridoma producing the mouse monoclonal 
antibody, and by ligating the C L gene (C gene encoding the constant region of L chain) 
obtained from the DNA encoding human immunoglobulin to the downstream of active V L 
genes (rearranged VJ gene encoding the variable region of L chain) obtained from the DNA 
encoding the mouse monoclonal antibody isolated from the hybridoma, and operably 
inserting those into the same or different vectors in an expressible manner, followed by 
transformation of host cells with the expression vector, and cultivation of the transformants. 
[00143] Specifically, DNAs are first extracted from mouse monoclonal antibody- 
producing hybridoma by the usual method, digested with appropriate restriction enzymes (for 
example, EcoRI and Hindlll), electrophoresed (using, for example, 0.7% agarose gel), and 
analyzed by Southern blotting. After the electrophoresed gel is stained, for example, with 
ethidium bromide and photographed, the gel is given marker positions, washed twice with 
water, and soaked in 0.25 M HC1 for 15 minutes. Then, the gel is soaked in 0.4 N NaOH 
solution for 10 minutes with gentle stirring. The DNAs are transferred to a filter for 4 hours 
following the usual method. The filter is recovered and washed twice with 2 x SSC. After 
the filter is sufficiently dried, it is baked at 75°C for 3 hours, treated with 0.1 x SSC/0.1% 
SDS at 65°C for 30 minutes, and then soaked in 3 x SSC/0.1% SDS. The filter obtained is 
treated with prehybridization solution in a plastic bag at 65°C for 3 to 4 hours. 
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[00144] Next, P-labeled probe DNA and hybridization solution are added to the bag 
and reacted at 65 °C about 12 hours. After hybridization, the filter is washed under an 
appropriate salt concentration, reaction temperature, and time (for example, 2 x SSC/0.1% 
SDS, room temperature, 10 minutes). The filter is put into a plastic bag with a little 2 x SSC, 
and subjected to autoradiography after the bag is sealed. 

[00145] Rearranged VDJ gene and VJ gene encoding H chain and L chain of mouse 
monoclonal antibody respectively are identified by Southern blotting mentioned above. The 
region comprising the identified DNA fragment is fractionated by sucrose density gradient 
centrifugation and inserted into a phage vector (for example, Charon 4A, Charon 28, 
AEMBL3, XEMBL4, etc.). E. coli (for example, LE392, NM539, etc.) are transformed with 
the phage vector to generate a genomic library. The genomic library is screened by plaque 
hybridization such as the Benton-Davis method (Science, 196:180-182 (1977)) using 
appropriate probes (H chain J gene, L chain (k) J gene, etc.) to obtain positive clones 
comprising rearranged VDJ gene or VJ gene respectively. By making the restriction map and 
determining the nucleotide sequence of the clones obtained, it is confirmed that genes 
comprising the desired, rearranged V H (VDJ) gene or V L (VJ) gene have been obtained. 
[00146] Separately, human CH gene and human CL gene used for chimerization are 
isolated. For example, when a chimeric antibody with human IgGl is produced, Cyi gene is 
isolated as a C H gene, and Ck gene is also isolated as a C L gene, are isolated. These genes 
can be isolated from human genomic library with mouse Cyi gene and mouse Ck gene, 
corresponding to human Cyi gene and human Ck gene, respectively, as probes, taking 
advantage of the high homology between the nucleotide sequences of mouse 
immunoglobulin gene and that of human immunoglobulin gene. 

[00147] Specifically, DNA fragments comprising human Ck gene and an enhancer 
region are isolated from human X Charon 4 A Haelll- Alul genomic library (Cell, 15:1157- 
1174 (1978)), for example, using a 3 kb Hindlll-BamHI fragment from clone Igl46 (Proc. 
Natl. Acad. Sci. USA, 75:4709-4713 (1978)) and a 6.8 kb EcoRI fragment from clone 
MEP10 (Proc. Natl. Acad. Sci. USA, 78:474-478 (1981)) as probes. In addition, for 
example, after human fetal hepatocyte DNA is digested with Hindlll and fractioned by 
agarose gel electrophoresis, a 5.9 kb fragment is inserted into X788 and then human Cyi gene 
is isolated with the probes mentioned above. 
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[00148] Using_a mouse Vh gene, mouse Vl gene, human Ch gene, and human Cl gene 
so obtained, and taking the p romoter region and enhancer region into consideration, human 
Ch gene is inserted downstream of mouse V H gene and human Cl gene is inserted 
downstream of mouse V L gene in an expression vector such as pSV2gpt or pSV2neo with 
5 appropriate restriction enzymes and DNA ligase following the usual method. In this case, 
chimeric genes of mouse V H gene/human C H gene and mouse V L gene/human Cl gene can be 
respectively inserted into a -the same or a_different expression vector. 

[00149] Chimeric gene-inserted expression vector(s) thus prepared are introduced into 
myeloma cells (e.g., P3X63 Ag8 653 cells or SP210 cells) that do not produce antibodies by 

10 the protoplast fusion method, DEAE-dextran method, calcium phosphate method, or 

electroporation method. The transformants are screened by cultivating them in a medium 
containing a drug corresponding to the drug resistance gene inserted into the expression 
vector and, then, cells producin g the desired chimeric monoclonal antibodies are obtained. 
[00150] Desired chimeric monoclonal antibodies are obtained from the culture 

15 supernatant of antibody-producing cells thus screened. 

[00151] The "humanized antibody (CDR-grafted antibody)" of the present invention is 
a monoclonal antibody prepared by genetic engineering and specifically means a humanized 
monoclonal antibody wherein a portion or the whole of the complementarity determining 
regions of the hyper-variable region are derived from those of the hyper-variable region from 

20 mouse monoclonal antibody, the framework regions of the variable region are derived from 
those of the variable region from human immunoglobulin, and the constant region is derived 
from that from human-immunoglobulin. 

[00152] The complementarity determining regions of the hyper- variable region exists 
in the hyper- variable region in the variable region of an antibody and means three regions 
25 which directly binds, in a complementary manner, to an antigen (complementarity- 
determining residues, CDR1, CDR2, and CDR3). The framework regions of the variable 
region mean four comparatively conserved regions intervening upstream, downstream or 
between the three complementarity-determining regions (framework region, FR1, FR2, FR3, 
and FR4). 

30 [00153] In other words, a humanized monoclonal antibody means that in which the 
whole region except a portion or the whole region of the complementarity determining 
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regions of the hyper-variable region of a mouse monoclonal antibody has been replaced with 
their corresponding regions derived from human immunoglobulin. 

[00154] The constant region derived from human immunoglobulin has the amino acid 
sequence inherent in each isotype such as IgQ IgM, IgA, IgD, and IgE. The constant region 
5 of a humanized monoclonal antibody in-of the present invention can be that from human 

immunoglobulin belonging to any isotype. Preferably it is the constant region of human IgG 
The framework regions of the constant region derived from human immunoglobulin are not 
particularly limited. 

[00155] The humanized monoclonal antibody of the present invention can be 
10 produced, for example, as follows. Needless to say, the production method is not limited 
thereto. 

[00156] For example, a recombinant humanized monoclonal antibody derived from 
mouse monoclonal antibody can be prepared by genetic engineering, referring to Published 
Japanese Translations of PCT International Publication No. Hei 4-506458 and Unexamined 

15 Published Japanese Patent Application (JP-A) No. Sho 62-296890. Namely, at least one 
mouse H chain CDR gene and at least one mouse L chain CDR gene corresponding to the 
mouse H chain CDR gene are isolated from hybridomas producing mouse monoclonal 
antibody, and human H chain gene encoding the whole region except human H chain CDR 
corresponding to mouse H chain CDR mentioned above and human L chain gene encoding 

20 the whole region except human L chain CDR corresponding to mouse L chain CDR 
mentioned above are isolated from human immunoglobulin genes. 

[00157] The mouse H chain CDR gene(s) and the human H chain gene(s) so isolated 
are inserted, in an expressible manner, into an appropriate vector so that they can be 
expressed. Similarly, the mouse L chain CDR gene(s) and the human L chain gene(s) are 

25 inserted, in an expressible manner, into another appropriate vector so that they can be 

expressed. Alternatively, the mouse H chain CDR gene(s)/human H chain gene(s) and mouse 
L chain CDR gene(s)/human L chain gene(s) can be inserted, in an expressible manner, into 
the same expression vector so that they can be expressed. Host cells are transformed with the 
expression vector thus prepared to obtain transformants producing humanized monoclonal 

30 antibody. By cultivating the transformants, desired humanized monoclonal antibody is 
obtained from culture supernatant. 
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[00158] The "human antibody" used in the present invention is immunoglobulin in 
which the entire regions comprising the variable and constant region of the H chain, and the 
variable and constant region o f the L chain constituting immunoglobulin are derived from the 
genes encoding human immunoglobulin. 
5 [00159] The human antibody can be produced in the same way as the production 
method of polyclonal or monoclonal antibodies mentioned above by immunizing, with an 
antigen, a transgenic animal which for example, at least human immunoglobulin gene(s) have 
been integrated into the locus of a non-human mammal such as a mouse by the usual method. 
[00160] For example, a transgenic mouse producing human antibodies is prepared by 
10 the methods described in already published literatures (Nature Genetics, 7: 1 3-21 (1994); 
Nature Genetics, 15:146-156 (1997); JP-WAHei 4-504365; W094/25585; Nikkei Science, 
No.6, pp.40-50 (1995); W094/25585; Nature, 368:856-859 (1994); JP-WANo. Hei 6- 
500233). 

[00161] The "portion of an antibody" used in the present invention means a partial 

15 region of the antibody, and preferably the monoclonal antibody of the present invention as 
mentioned above, and specifically, means F(ab') 2 , Fab', Fab, Fv (variable fragment of 
antibody), sFv, dsFv (disulfide stabilized Fv), or dAb (single domain antibody) (Exp. Opin. 
Ther. Patents, 6, No.5, pp.441-456 (1996)). 

[00162] "F(ab') 2 " and "Fab"' can be produced by treating immunoglobulin 
20 (monoclonal antibody) with a protease such as pepsin and papain, and means an antibody 
fragment generated by digesting immunoglobulin near the disulfide bonds existing between 
the hinge regions in each of the two H chains. For example, papain cleaves IgG upstream of 
the disulfide bonds existing between the hinge regions in each of the two H chains to 
generate two homologous antibody fragments in which an L chain composed of V L (L chain 
25 variable region) and C L (L chain constant region), and an H chain fragment composed of V H 
(H chain variable region) and C H yl (yl region in the constant region of H chain) are 
connected at their C terminal regions through a disulfide bond. Each of these two 
homologous antibody fragments is called Fab'. Pepsin also cleaves IgG downstream of the 
disulfide bonds existing between the hinge regions in each of the two H chains to generate an 
30 antibody fragment slightly larger than the fragment in which the two above-mentioned Fab' 
are connected at the hinge region. This antibody fragment is called F(ab') 2 . 



[00163] The "cell producing a monoclonal antibody reactive to a protein or a fragment 
thereof of the present invention means any cell producing the above-described monoclonal 
antibody of the present invention. 

[00164] More specifically, the following is included: 

(1) B cells that are obtained by immunizing the non-human mammals with the 
above-mentioned protein of the present invention, a fragment thereof, or the cells 
producing the protein and that produce a monoclonal antibody reactive to the protein 
of the present invention or a fragment thereof. 

(2) The above-mentioned hybridomas (fused cell) prepared by fusing the thus- 
obtained B cells producing the antibody with myeloma cells derived from mammals. 

(3) Monoclonal antibody-producing transformants obtained by transforming cells 
other c e lls than the monoclonal antibody-producing B cells and hybridomas with 
genes encoding the monoclonal antibody isolated from the monoclonal antibody- 
producing B cells or hybridomas (either the heavy chain-encoding gene or the light 
chain-encoding gene, or both). . 

The monoclonal antibody-producing transformants of (3) mean recombinant 
cells producing a recombinant monoclonal antibody produced by B cells of (1) or 
hybridomas of (2). These antibody producing-transformants can be produced by the 
method as used for producing the above-described chimeric monoclonal antibody and 
humanized monoclonal antibody. 
[00165] The "pharmaceutical composition" used herein means a pharmaceutical 
composition comprising of any of the protein, fragment thereof, antibody, or portion thereof 
defined hereinabove, and a pharmaceutically acceptable carrier. 

[00166] The "pharmaceutically acceptable carrier" includes an excipient, a diluent, an 
expander, a disintegrating agent, a stabilizer, a preservative, a buffer, an emulsifier, an 
aromatic, a colorant, a sweetener, a viscosity-increasing agent, a flavor, a dissolving agent, or 
other additives. Using one or more of such carriers, a pharmaceutical composition can be 
formulated into tablets, pills, powders, granules, injections, solutions, capsules, troches, 
elixirs, suspensions, emulsions, or syrups. The pharmaceutical composition can be 
administered orally or parenterally. Other forms for parenteral administration include a 
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solution for external application, suppository for rectal administration, and pessary, 
prescribed by the usual method, which comprises one or more active ingredient. 
[00167] The dosage can vary depending on the age, sex, weight, and symptoms of a 
patient, effect of treatment, administration route, period of treatment, or the kind of active 
5 ingredient (protein or antibody mentioned above) contained in the pharmaceutical 

composition. Usually, the pharmaceutical composition can be administered to an adult in a 
dose of 10 fig to 1000 mg (or 10 jig to 500 mg) per one administration. Depending on 
various conditions, the lower dosage may be sufficient in some cases, and a higher dosage 
may be necessary in other cases. 

10 [00168] In particular, the injection can be produced by dissolving or suspending the 
antibody in a non-toxic, pharmaceutically acceptable carrier such as physiological saline or 
commercially available distilled water for injections by adjusting the concentration to 0.1 jig 
antibody/ml carrier to 10 mg antibody/ml carrier. The injection thus produced can be 
administered to a human patient in need of treatment in a dose of 1 \xg to 100 mg/kg body 

15 weight, preferably 50 jig to 50 mg/kg body weight, once or more times a day. Examples of 
administration routes are medically appropriate administration routes such as intravenous 
injection, subcutaneous injection, intradermal injection, intramuscular injection, or 
intraperitoneal injection, preferably intravenous injection. 

[00169] The injection can also be prepared into a non-aqueous diluent (for example, 
20 propylene glycol, polyethylene glycol, vegetable oil such as olive oil, and alcohols such as 
ethanol), suspension, or emulsion. 

[00170] The injection can be sterilized by filtration with a bacteria-non-penetrable 
filter, by mixing bactericide, or by irradiation. The injection can be prepared at the time of 
use. Namely, it is freeze-dried to make a sterile solid composition, and can be dissolved in 

25 sterile distilled water for injection or another solvent before use. 

[00171] The pharmaceutical composition of the present invention is useful as a drug 
for preventing and treating, for example, primary immunodeficiency syndrome with 
congenital disorder of immune system, mainly immunodeficiency considered to develop by 
B lymphocyte deficiency, decrease, or dysfunction (e.g., sex-linked agammaglobulinemia, 

30 sex-linked agammaglobulinemia with growth hormone deficiency, immunoglobulin 
deficiency with high IgM level, selective IgM deficiency, selective IgE deficiency, 
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immunoglobulin heavy chain gene deletion, k chain deficiency, IgA deficiency, IgG subclass 
selective deficiency, CVID (common variable immunodeficiency), infantile transient 
dysgammaglobulinemia, Rosen syndrome, severe combined immunodeficiency (sex-linked, 
autosomal recessive), ADA (adenosine deaminase) deficiency, PNP (purine nucleoside 
5 phosphorylase) deficiency, MHC class II deficiency, reticular dysplasia, Wiskott-Aldrich 
syndrome, ataxia telangiectasia, DiGeorge syndrome, chromosomal aberration, familial Ig 
hypermetabolism, hyper IgE syndrome, Gitlin syndrome, Nezelof syndrome, Good 
syndrome, osteodystrophy, transcobalamin syndrome, secretory bead syndrome, etc.), various 
diseases with antibody production deficiency that are secondary immunodeficiency 

10 syndrome with disorder of immune system caused by an acquired etiology (for example, 
AIDS, etc.), and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, 
conjunctivitis, allergic rhinitis, allergic enteritis, drug-induced allergy, food allergy, allergic 
urticaria, glomerulonephritis, etc.), and for r e living relieving conditions due to various 
immunod e ficiency immunodeficiencies associated with the diseases. 

1 5 [00172] The DNA of the present invention described above, namely, "DNA 

comprising any partial nucleotide sequence of SEQ ID NO:7, from SEQ ID NO:9 to SEQ ID 
NO:15,_or SEQ ID NO:35, those with partial chemical modification, DNA comprising 
complementary nucleotide sequences to the partial sequence, or those with partial chemical 
modification" are included. 

20 [00173] Here, the "partial nucleotide sequence" means the partial nucleotide sequence 
comprising any number of bases at any region included in any nucleotide sequence listed in 
SEQ ID NO:7, from SEQ ID NO:9 to SEQ ID NO: 15, or SEQ ID NO:35. 
[00174] The DNA is useful as probes in DNA hybridization or RNA hybridization 
procedures. fe- For the purpose of using the DNA as a probe, continuous nucleotide 

25 sequences of over 20 bases, preferably continuous nucleotide sequences of over 50 bases, 
more preferably over 100 bases, much more preferably over 200 bases, especially preferably 
over 300 bases, can be-listed used as the partial nucleotide sequences. 
[00175] Also, the DNA described abeve- above, as mentioned before, are useful as 
primers for PCR. fe -For the purpose of using the DNA as PCR primers, continuous partial 

30 nucleotide sequences of from 5 to 100 bases, preferably from 5 to 70 bases, more preferably 
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from 5 to 50 bases, much more preferably from 5 to 30 bases, can be listed- used as the partial 
nucleotide sequences. 

[00176] Moreover, the DNA described above are useful as antisense drug. The DNA, 
by hybridizing to a DNA or an RNA encoding the AID protein of the present invention, can 
5 inhibit transcription of the DNA to mRNA or translation of the mRNA into the protein. 
[00177] fe -For the p urpose of using above-mentioned DNA to antisense drug, the 
partial nucleotide sequence consists of 5 to 100 consecutive nucleotides, preferably 5 to 70 
consecutive nucleotides, more preferably 5 to 50 consecutive nucleotides, and still more 
preferably 5 to 30 consecutive nucleotides. 

1 0 [001 78] When the DNA is used as an antisense DNA pharmaceutical, the DNA 
sequence can be modified chemically in part for extending the half-life (stability) of the 
blood concentration of the DNA administered to patients, for increasing the intracellular- 
membrane permeability of the DNA, or for increasing the degradation resistance or the 
absorption of the orally administered DNA in the digestive organs. The chemical 

15 modification includes, for example, the modification of the phosphate bonds, the riboses, the 
nucleotide bases, the sugar moiety, the 3' end and/or the 5' end in the structure of the 
oligonucleotide DNA. 

[00179] The modification of phosphate bonds includes, for example, the conversion of 
one or more of the bonds to phosphodiester bonds (D-oligo), phosphorothioate bonds, 

20 phosphorodithioate bonds (S-oligo), methyl phosphonate (MP-oligo), phosphoroamidate 
bonds, non-phosphate bonds or methyl phosphonothioate bonds, or combinations thereof. 
The modification of the ribose includes, for example, the conversion to 2 , -fluororibose or 2'- 
O-methylribose. The modification of the nucleotide base includes, for example, the 
conversion to 5-propynyluracil or 2-aminoadenine. 

25 [00180] Also, another ana- embodiment of the present invention relates to "methods of 
identifying substances regulating the production of the AID protein of the present invention 
or the transcription of the gene encoding AID protein to mRNA.": The method of the present 
invention is namely "the method of screening of drugs capable of regulating functions of AID 
protein or AID gene^ 

30 [00181] As the cells us e d in the method of the present invention, any cells, as long as 
capable of producing AID protein of the present invention, can be used. For instance, native 
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cells (preferably of mouse or human), transgenic cells transformed with a gene encoding an 
AID protein of the present invention, cells introduced with RNA encoding an AID protein of 
the present invention, etc., can be fetedr used. 

[00182] As the host cells us e d for preparing the transgenic cells, various cells, 
5 mentioned in the part explaining in detail en-the methods of expressing the protein of the 
present invention using the DNA of the protein described above, can be used. 
[00183] For instance, various cells such as naturally established cells or artificially 
established transgenic cells (e.g. bacteria {Escherichia, Bacillus), yeast (Saccharomyces, 
Pichia), animal cells and insect cells) can be exemplified. 

10 [00184] Preferably, animal cells, namely, cells derived from mouse (COP, L, C127, 
Sp2/0, NS-1, or NIH3T3, etc.), cells derived from rat (PC12, PC12h, etc.), cells derived from 
hamster (BHK, and CHO, etc.), cells derived from monkey (COS1, COS3, COS7, CV1, and 
Velo, etc.), and cells derived from human (Hela, cells derived from diploid fibroblast, 
HEK293 cells, myeloma cells, and Namalwa, etc.) can be exemplified. 

15 [00185] The "substance" in the present invention means natural substance existing in 
the nature and any substance prepared artificially. The substances can be grouped into 
"peptidic substance" and "non-peptidic substance/V 

[00186] As the "non-peptidic substance/ \ "DNA comprising partial nucleotide 
sequence, or chemically modified DNA derived from it" that are useful as antisense drugs as 

20 described above, "antisense RNA" with similar structural and pharmacological property to 
the antisense DNA, or any chemically synthesized "compounds" can be exemplified. The 
"compounds" h e r e in means compounds e xc e pting DNA, RNA, and p e ptidic substanc e s. 
Namely, compounds with mol e cular weight of smaller than from 100 to approximately 1000, 
pr e ferably compounds with molecular w e ight of from 100 to 800, mor e pr e f e rably molecular 

25 w e ight of from 100 to 600, can b e e x e mplified. Examples of "compounds" are compounds 
other than DNA, RNA, and the above-mentioned peptidic substances, which have a 
molecular weight from approximately 100 to approximately 1000, preferably from 
approximately 100 to approximately 800, more preferably from approximately 100 to 
approximately 600. 

30 [00187] As the "peptidic substance/'^ antibodies already described above in detail 
(preferably monoclonal antibodies, more preferably recombinant antibodies or human 



47 



monoclonal antibodies), oligopeptides, or chemically modified substance derived from them 
can be exemplified. Examples of an oligopeptide are a peptide comprising 5 to 30 amino 
acids, preferably 5 to 20 amino acids. The chemical modification can be designed depending 
on various purposes, for example, the-for increased half-life in blood in the case of 
5 administering in vivo, or the-for increased tolerance against the-degradation or increased 
absorption in digestive tract at th e after oral administration. 

[00188] Methods described in from (24) to (28) above includes so-called reporter gene 
assays, as one of the method of the present invention. 

[00189] As the "reporter protein 4 " ? luciferase derived from firefly or sea pansy, or GFP 

1 0 derived from i e rryfis h iellyfish , are preferred. 

[00190] As the "reporter gene assay^ methods described below are representative. 
[00191] Transgenic cells are generated by transforming cells commonly used in the 
production of recombinant proteins with expression vector, in which a gene encoding the 
target protein and a gene encoding the reporter protein are inserted intote the vector so that 

1 5 the transcription of the gene encoding the reporter protein- to mRNA occurs dep e nd e ntly 
dep e nd e ns -en- is induced by the signal of the transcription of the gene of target protein to 
mRNA. The test substances (described above) are applied to the obtained transformant cells. 
Analysis that whether the compound affects the expression of transporter molecule can be 
accomplished by measuring the level of the target protein by indirect m e asuring 

20 measurement of the amount of fluorescence emitted by the reporter protein expressed in 
parallel with the target protein (for reference, see U.S. Pat. No. 5,436,128 and U.S. Pat. No. 
5,401,629). 

[00192] Also, t The identification of the compounds using the present assay can be 
accomplished by manual operation, but it can also be readily and simply done automatically 
25 by using-so-called High-Throughput Screening using robots (SOSHIKI BAIYO KOUGAKU, 
23:521-524; U.S. Pat. No. 5,670,113). 

[00193] The -The terms "cells" and "substances" used in the methods described above 
contain have the same meaning as defined above. 

[00194] The substances identified by the methods of the present invention are very 
30 useful as drugs for the therapy of various diseases considered to be caused by the 

hyperfunction or deficiency of the AID protein of the present invention or by the deficiency 
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or mutation of the AED gene, or for remission of various symptoms sup e rv e n e associated 
with the diseases. 

DESCRIPTION OF DRAWINGS 

FIG 1 is a4s-the - pair of p hotographs which shows the production state-of DNA 
5 including an Sa sequence looped out with th eby class switch recombination in mouse B cell 
clone CH12F3-2 cultured under the-various conditions. 

FIG. 1(a) shows the e l e ctrophor e tic electrophoresiss tate of DNA including 11111 
sequence looped out wkh-bv class switch recombination, amplified by PCR using DNA 
derived from mouse B cell clone CH12F3-2 cultured under-the various conditions. 

10 Lanes 1 and 6 show the e l e ctrophor e tic electrophoresis state of marker DNAs. _Lane 

2 shows the e l e ctrophoretic electrophoresis s tate-of PCR product s produced using DNA from 
cells cultured in the-conditions excluding IL-4, CD40L, TGF p e^and_cycloheximide ? as a 
template. _Lane 3 shows the e lectrophorotic electrophoresis s tate-of BNA -PCR p roducts 
produced using DNA from cells cultured in the presence of cycloheximide only, as a 

15 template. _Lane 4 shows the electrophoresis^ state-of PCR product s produced using DNA 
from cells cultured in the presence of IL-4, CD40L and TGF p T as a template.. Lane 5 shows 
the e l e ctrophor e tic electrophoresis state-of PCR product s produced using DNA from cells 
cultured in the presence of IL1-4, CD40L, TGF p, and cycloheximide^ as a template. 

FIG 1(b) shows the result of Southern hybridization fer-of DNA including an Sa 

20 sequence looped out with-bv class switch recombination, amplified by PCR using DNA 
derived from mouse B cell clone CH12F3-2 cultured under the-various conditions. 

Lane 1 shows the result of hybridization agams^of PCR product s produced using 
DNA from cells cultured in the-conditions excluding any-ef IL-4, CD40L, TGF p e^-and 
cycloheximide^ as a template. _Lane 2 shows the result of Southern hybridization against of 

25 PCR product s produced using DNA from cells cultured in the presence of cycloheximide 
only 7 as a template. _Lane 3 shows the result of hybridization agamstof PCR products 
produced using DNA from cells cultured in the presence of IL-4, CD40L, and TGF p 7 as a 
template-^Lane 4 shows the result of hybridization against PCR product s produced using 
DNA from cells cultured in the presence of IL-4, CD40L, TGF p and cycloheximide^ as a 

30 template. 
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FIG. 2 is a pair of p hotographs showing the production state-of DNA including an Sa 
sequence looped put wit h out by class switch recombination, amplified by PCR using DNA 
derived from mouse B cell clone CH12F3-2 cultured in various conditions. 

FIG 2(a) shows the e lectrophor c tic electrophoresis s tate-of DNA from DNA 
5 including an Sa sequence looped out wtih-by class switch recombination in mouse B cell 
clone CH12F3-2 cultured in the-various conditions, stained with ethidium bromide. 

Lanes 1 and 6 show the e l e ctrophor e tic electrophoresis s tate-of marker DNAs. _Lane 
2 shows the e l e ctrophor e tic electrophoresis state-of PCR products produced using DNA from 
cells cultured in the-conditions excluding IL-4, CD40L, TGF (3-e r and cycloheximide^ as a 

10 template.. Lane 3 shows the e l e ctrophoretic electrophoresis state of DNA rPCR p roducts 
producing using DNA from cells cultured in the presence of cycloheximide only, as a 
template. _Lane 4 shows the e l e ctrophor e tic electrophoresis stat e for of PCR products 
produced using DNA from cells cultured in the presence of IL-4, CD40L and TGF p T as a 
template. _Lane 5 shows the e l e ctrophor e tic electrophoresis stat e for of PCR products 

1 5 produced using DNA4eFm- from cells cultured in the presence of 11-4, CD40L, TGF P-, and 
cycloheximide^ as a template. 

FIG 42(b) shows the result of Southern hybridization fe^of DNA including an Sa 
sequence looped out with class switch recombination, amplified by PCR using DNA derived 
from mouse B cell clone CH12F3-2 cultured under the-various conditions. 

20 Lane 1 shows the result of hybridization against ofa-PCR product s produced using 

DNA from cells cultured under the-conditions excluding any on e of . IL-4, CD40L, TGF p or 
and_cycloheximide, as a template. _Lane 2 shows the result of hybridization against a-PCR 
product s produced using DNA from cells cultured in the presence of cycloheximide only T _ as 
a template.. Lane 3 shows the result of hybridization against a-PCR product s produced using 

25 the-DNA from cells cultured in the presence of IL-4, CD40L, and TGF p ? as a template. 

Lane 4 shows the result of hybridization against arPCR product s produced using DNA from 
cells cultured in the presence of IL-4, CD40L, TGF P and cycloheximide^ as a template. 

FIG 3 shows the result of Northern blotting using a radiolabeled cDNA fragment 
encoding a radiolabel e d 23C9 (AID) protein as a probe , against mRNA derived from mouse 

30 B cell clone CH12F3-2 cultured under the various conditions. 
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Lane 1 shows the result of blotting against mRNA from cells cultured in the 
conditions excluding any on e of IL-4, CD40L, e^TGF p e ^and cycloheximide. JLane 2 
shows the result of blotting against mRNA from cells cultured in the presence of 
cycloheximide ? only. _Lane 3 shows the result of blotting against mRNA from cells cultured 
5 in the presence of IL-4, CD40L i and TGF p. Lane 4 shows the result of blotting against 
mRNA from cells cultured in the presence of IL-4, CD40L, TGF p, and cycloheximide. 

FIG. 4 shows the result of Northern blotting using_a radio-labeled cDNA fragment 
coding 23C9 (AID) protein as a probe against mRNA derived from mouse B cell clone 
CH12F3-2 cultured in-the various conditions. 
10 Lane 1 shows the result of blotting against mRNA from cells cultured in the 

conditions excluding IL-4, CD40L, TGF p er -and cycloheximideT Lane 2 shows the result of 
blotting against mRNA from cells cultured in the presence of cycloheximide, only. _Lane 3 
shows the result of the-blotting against mRNA ferm- from cells cultured in the presence of IL- 
4, CD40L, and TGF p. _Lane 4 shows the result of the-blotting against mRNA ferm-from 
15 cells cultured in the presence of IL-4, CD40L, TGF p^ and cycloheximide. 

FIG 5 shows the homology between an amino acid sequence of mouse AID protein 
(SEP ID NO^ and that of mouse APOBEC-1 (SEQ ID NO:36). 

An amino acid in a closed box shows an identical amino acid. _A region in an open 
box indicates a cytidine deaminase motif. _An amino acid with an asterisk (*) or an arrow 
20 indicates an amino acid conserved among APOBEC-1 proteins derived from rat, mouse, 
rabbit, and human. 

FIG 6 shows a phylogenic tree of various enzymes belonging to a cytosine nucleoside 
/-nucleotide deaminase family, prepared based on cytidine deaminase motif. 

FIG 7 shows a photograph indicating the electrophoretic state for AID-GST fusion 
25 protein in the molecular weight analysis by the gel electrophoresis and silver staining 
method. 

Lane 1 shows the electrophoretic state for a marker molecule. _Lane 2 shows the 
electrophoretic state for various proteins included in extracts from wild type Escherichia coli 
DH 5a. _Lane 3 shows the electrophoretic state for purified AID-GST fusion protein. 
30 FIG 8 shows the electrophoretic state for AID-GST fusion protein by Western 

blotting using anti-AID protein peptide antibody. 



Lane 1 shows the electrophoretic state for various proteins included in the extract 
from wild type E. coli DH5a. 

Lane 2 shows the electrophoretic state for purified AID-GST protein. 
FIG. 9 shows a cytidine deaminase activity depending on the concentrations of AID 
5 proteins. 

FIG 10 shows the inhibitory effect of tetrahydrouridine which is an inhibitor specific 
to cytidine deaminase on a cytidine deaminase activity in ATP protein. 

FIG 11 shows the inhibitory effect of each of 1,10-o-phenanthrolime which is a zinc- 
chelating agent, and 1,7-o-phenanthroline which is an inactivated isomer thereof on a 
1 0 cytidine deaminase activity in AID protein. 

FIG 12 is a photograph indicating expression state for mRNA of AID in various 
tissues in mouse, analyzed by Northern blotting method. 

FIG. 13 is a photograph indicating the expression state for mRNA of AID in various 
lymphatic tissues in mouse, analyzed by RT-PCR method. 
15 FIG 14 is the photograph showing expression state for mRNA of AID as -over time 

gees, in activated mouse B cell clone CH12F3-2, analyzed by Northern blotting method. 

FIG 15 a photograph showing expression state for mRNA of AID in mouse B cell 
clone CH12F3-2 stimulated with cytokine in various combinations, analyzed by Northern 
blotting. 

20 FIG 16 shows a photograph indicating expression state for mRNA of AID in mouse 

spleen B cells, stimulated with stimulants in various combinations, analyzed by Northern 
hybridization method. 

FIG. 17 is a photograph indicating expression state for mRNA of AID in splenocytes 
derived from mice immunized with sheep red blood cells, analyzed by Northern blotting 

25 analysis. 

FIG 18 shows expression state for mRNA of AID in splenocytes derived from mice 
immunized with sheep red blood cells, analyzed by RT-PCR. 

FIG. 19 is a photograph indicating localization of expression for AID mRNA in 
splenocytes derived from a normal mouse or a mouse immunized by sheep red blood cells, 
30 specifically, analyzed by in situ hybridization. 
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FIGs. 19 (A) and (D) indicate the result in the hybridization using a sense AID probe. 
FIGs. 19 (B) and (E) show localization for AID mRNA expression in hybridization using an 
antisense-AID probe. FIGs. 19 (C) and (F) show localization of germinal center in staining 
test by FITC-labeled PNA. FIGs 19 (A), (B), and (C) indicate the result in the test using 
5 spleen tissues derived from normal mouse (before the immunization of sheep red blood 
cells). FIGs. 19 (D), (E), and (F) show the results of the examination using spleen tissue 
slices prepared 5 days after immunizing a mouse with sheep red blood cells. 

FIG 20 is a photograph showing the localization of expression for ATP mRNA in 
spleen tissue and payer's patch tissue, respectively, derived from a normal mouse or from a 
10 mouse immunized with sheep red blood cells, respectively, analyzed by in situ hybridization. 

FIGs. 20 (A), (D), and (G) show the results in the hybridization using a sense AID 
probe. FIGs. 20 (B), (E), and (H) show the localization of the expression for AID mRNA in 
hybridization using an antisense AID probe. FIGs. 20 (C), (F), and (I) show the localization 
of germinating center in the staining test by FITC-labeled PNA. FIGs. 20 (A), (B), and (C) 
15 show the result of the examination using spleen tissues derived from a normal mouse (before 
immunization by sheep red blood cells). FIGs. 20 (D), (E), and (F) show the results of the 
examination using spleen tissue slices prepared 5 days after immunization of a mouse with 
sheep red blood cells. FIGs. 20 (G), (H), and (I) show the results of test using payer's patch 
prepared 5 days after the immunization of a mouse with sheep red blood cells. 
20 FIG 2 1 schematically shows relative locations of partial nucleotide sequences of 

human genomic DNA coding human AID protein, which was amplified by PCR using 
various pairs of primers. 

FIG. 22 schematically shows a degree of homology between an amino acid sequence 
of mouse AID protein -(SEP ID NO^) and that of human AID protein (SEO ID NO:8\ _The 
25 parts with a closed box are cytidine and deoxycytidylate deaminase zinc-binding region 
which is an AID protein active region. 

FIG. 23 schematically shows the structure of human genomic DNA including a gene 
coding human AID protein. _One to five shows exon 1, exon 2, exon 3, exon 4, and exon 5, 
respectively. 

30 FIG 24 is a photograph indicating the expression state for human AID mRNA in 

various types of human tissues, analyzed by RT-PCR. 
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FIG 25 is a photograph indicating a location (localization) of human AID gene on 
human chromosome, analyzed by Fluorescence in situ hybridization (FISH) method. 

Two points at the tips of arrows show 12pl3 A where the human AID gene e xists is 
located .T 

5 Like reference symbols in the various drawings indicate like elements. 

DETAILED DESCRIPTION OF THE INVENTION 
[00195] The invention is illustrated in details by the following Examples, butjs not 
restricted to theembodiments described in the Examples. 

10 [00196] Example 1 : Culture of fe^-mouse B cell clone CH12F3-2 and confirmation of 
properties 

[00197] Mouse B cell clone CH12F3-2 occurring that undergoes class switch 
recombination (CSR) from IgM to IgA, several hours after the-_stimulation by- with IL-4, 
TGF-p, and CD40L, previously isolated by the present inventors, was cultured in the same 
15 manner as in the previous report (Immunity, 9:1-10, 1998; Curr. Biol., 8:227-230, 1998; Int. 
Immunol, 8:193-201, 1996). 

[00198] When the-CH12F3-24s- cells were stimulated bv -with IL-4. TGF-p, and 
CD40L, a circular DNA including an S region (switch region) looped out by class switch 
recombination was detected several hours after the stimulation. 
20 [00199] The following manipulation was conducted according to the previously 
report ed method (Curr. Biol., 8:227-230, 1998). 

[00200] The -CH12F3-2 B cells CH12F3 2 , either stimulated by IL-4, TGF-p, and 
CD40L_ol , and that which was not unstimulated., were cultured for 6 hours in the presence or 
absence of cycloheximide (200 ng/ml) which is a protein synthesis inhibitor, respectively. 

25 Genomic DNA was extracted from each cell, and PCR was conducted with the DNA as a 
template-by following the-standard methods to amplify circular DNA including an S/i 
sequence and an Sa sequence. _PCR was conducted byrusing a-pairs of primers, aFl and 
liRS^jgr and th e other FCR was conducted by using a pair of prim e rs, aFl and jaR3. 
[00201] As a control, genomic DNA encoding glyceraldehyde-3-phosphate 

30 dehydrogenase (GAPDH) was amplified by PCR. 
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[00202] The PCR products was- were subjected to the-gel electrophoresis by- and 
ethidium bromide staining._ FIG 1(a) and FIG. 2 (a) show the results. 
[00203] To confirm the presence or absence of the amplification of a circular DNA 
including the looped-out S region, Southern hybridization was conducted against the PCR 
5 product-sby using_a mouse Sa region gene fe^as a hybridization probe, according to-the 

standard methods (L. Sambrook E. R, Tom Maniatis., Second edition, Ed. Molecular Cloning 
(Nolan, C, Ed.) Cold Spring Harbor, 1989). . As an Sa g o n e , a A 1,155 bp DNA fragment 
obtained by digesting 10 kb EcoRI cleaved fragment IgH703 with Hind III and Earl was used 
asanStx gene (Genbank #D11468, DNA No. 1993-3148) (J, Biol. Chem., Vol. 268, p. 4651- 

10 4665). _FIGs. 1(b) and 2(b) show the results. 

[00204] It has been shown that mouse B cell CH2F3-2 produces the looped-out DNA 
containing the Sa sequence with the class switch recombination by-ihe- after stimulation with 
cytokines, and the production of the DNA is inhibited by the presence of cycloheximide. 
This result suggested that the occurrence of class switch recombination of an 

1 5 immunoglobulin gene n ee ds a nov e l synthesis of a protein requires de novo protein synthesis 
in the very early stages after the-stimulation a and the -that protein synthesis is deeply involved 
in the induction of-the class switch recombination . 

[00205] Example 2: Identification of a gene whieh -whose expression is improved in 

20 mouse B cell CH12F3-2 stimulated by cytokines^ 

[00206] A gene which is presumably expressed in the early stage after mouse B cell 
clone CH12F3-2 is stimulated, and presumably plays a role e£in introducing class switch 
recombination of an immunoglobulin gene,, was attempted to be isolated from the-CH12F3-2 
cells by the suppression subtract hybridization (SSH) (Proc. Natl. Acad. Sci. USA, 93:6025- 

25 6030, 1996; Anal. Biochem., 240:90-97, 1996) using the inhibitory PCR effect (Nucleic 
Acids Res., 23:1087-1088, 1995). 

[00207] A cDNA library necessary for subtraction cloning was prepared by-using_a 
PCR-Select Subtraction Kit (CLONTECH, Catalogue NO: Kl 804-1) by following the 
instruction manual suppl e m e nted supplied with the kit in th e e xp e rim e ntal manipulation . 
30 [00208] PolyA + -RNA was isolated from each of. mouse B cell clone CH12F3-2 
stimulated with IL-4, TGF-p and CD40L for 5 hours, the same cells stimulated with the 
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cytokines for 12 hours, and-tfee cells which were not stimulated, ^following the reported 
method (Nucleic Acids Res., 26:911-918, 1998) and treated with DNasel to eliminate any 
contaminating g enomic DNA which may bo mixed . Then cDNA was prepared based on each 
polyA + -RNA sample using reverse transcriptase according to the standard method. Each 
5 cDNA prepared from mouse B cell clone CH12F3-2, treated with the above cytokines for 5 
or 12 hours,, was mixed with sam e in equimolar m ele-amounts to be used as a tester cDNA. 
On the other hand, cDNA derived from unstimulated cells was used as a driver cDNA. 
[00209] Subtraction was conducted by adding the driver cDNA into the tester cDNA 
according to the abeve- above-referenced p revious report and the e xperimental manipulation 
10 kit instruction manual. The efficiency of subtraction was monitored by adding a small 
amount (1 : 1000 mole ratio) of 0_-X174 phage DNA cleaved at the restriction enzyme site 
Hae III, as a control, into the tester cDNA. After the subtraction, the phage DNA was 
concentrated to a mole ratio of about 100 times. 

[00210] The subtracted cDNA was cloned into the T- vector (Promega) according to the 
15 standard method to prepare a plasmid library. In the same manner as in the previous report, 
2000 colonies in the library were screened by the differential hybridization method (Nucleic 
Acids Res., 26:911-918, 1998; Medical immunity, 29:17, p. 451-459, 1997). Each of the 
above tester cDNA and driver cDNA was radiolabeled to be used for hybridization. Clones 
including O XI 74 phage DNA were selected by hybridizing O XI 74 phage DNA with a 
20 replicant filter. 

[00211] One hundred fifteen clones emitting a stronger signal than the radio-labeled 
driver cDNA probe against radio lab e led radiolabeled tester cDNA probe were identified 
and a nucleotide sequence of each clone was determined by using a DNA sequencer. 
[00212] Northern blotting was conducted against mRNA obtained from mouse B cell 

25 clone CH12F3-2 stimulated with IL-4, TGF-p and CD40L or the same cell lines 

unstimulated, using the radio-labeled DNA inserted into the each clone as a probe, according 
to the standard method (L. Sambrook, E. R, Tom Maniatis., Second edition, Ed. Molecular 
Cloning (Nolan, C, Ed.), Cold Spring Harbour, 1989). As a result, the enhanced expression 
corresponding to the stimulation with the above cytokines was observed in 23 among o ut of 

30 115 clones. Gene fragments coding 7 different types of proteins, including genes coding the 
3 kinds of known proteins and 4 kinds of novel proteins were found to be inserted into the 23 



clones. .Specifically, the expression of the 7 kinds of genes were-was found to be enhanced in 
mouse B cell clone CH12F3-2 by the stimulation with IL-4, TGF-0 and CD40L. 

[00213] <The known proteins> 
5 ABCD-1/MDC (8 clones) 
IFNgy receptor (2 clones) 
I-a(MHC class II) (1 clone) 

[00214] <Novel proteins> 
10 23C9 (3 clones) 
15B11 (7 clones) 
8B9 (1 clone) 
16A9 (1 clone 

15 [00215] As it has been known that the expression of the above I-a gene and 

ABCD/MDC gene is enhanced by stimulating mouse spleen B cell with IL-4 and CD40L, it 
was confirmed that the subtraction cloning was effectively conducted (J. Exp. Med., 
188:451-463, 1998; Immunity, 5:319-330, 1996X 

20 [00216] Example 3: Expression of mRNA for a novel protein 23C9 in mouse B cell 
clone CH12F3-2 

[002 1 7] The degree of enhanced expression of the g ene coding for a novel protein 
23C9 in mouse B cell clone CH12F3-2 stimulated with IL-4, TGF-p and CD40L was 
analyzed according to the standard method (L. Sambrook, E. F., Tom Maniatis., Second 
25 edition, Ed. Molecular Cloning (Nolan, C, Ed.), Cold Spring Harbour, 1989) by Northern 
blotting. 

[002 1 8] Mouse B cell clone CH 1 2F3 -2 was cultured in the presence of one of the 
following regents for 12 hours. 

(1) IL-4, TGF-P and CD40L only. 
30 (2) Cycloheximide which is a protein synthesis inhibitor (200 ng/ml), only 

(3) IL-4, TGF-P and CD40L as well as Cycloheximide (200 ng/ml) 
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[00219] Northern blotting was conducted against mRNA (10 jag for each group) 
obtained in the same manner as the-previously reported (Nucleic Acid Res., 26:911-918, 
1998) from each group of treated cells using a radio-labeled cDNA fragment (1,020 bp) 
5 coding - for a novel protein 23C9, obtained in the above Example, according to the standard 
method (L. Sambrook, E. E, Tom Maniatis., Second edition, Ed. Molecular Cloning (Nolan, 
C, Ed.), Cold Spring Harbour, 1989). 

[00220] As a control e xaminatio n experiment Northern blotting was conducted for 
mRNA derived from B cell clone CH12F3-2 cultured without any one of the above 

10 cytokines, or cycloheximide. 

[00221] The amount of mRNA to be electrophoresed was adjusted using the amount of 
mRNA m-for glyceraldehvde-3 -phosphate dehydrogenase (GAPDH) as an index._ DNA 
amplified by RT-PCR using_a GF primer and a^GR primer was used as a probe for 
blotting blots of ef GAPDH mRNA (Location of nucleotides: 566-1016, Genbank U5299) 

15 (Immunity, 9:1-10, 1998). 

[00222] FIGs. 3 and 4 show the results. 

[00223] The expression of mRNA for a novel protein 23C9 was extremely strong in 
mouse B cell clone CH12F3-2 stimulated with IL-4, TGF-P and CD40L, while the 
expression in unstimulated cells was extremely weak. Expression of the mRNA in the 
20 stimulated cells was inhibited by the presence of a protein synthesis inhibitor. Moreover, in 
the stimulated cells, two bands indicating the expression of mRNAs comprising different 
nucleotide lengths were detected. 

[00224] Expression of mRNA for a novel protein 23C9 in each of the following mouse 
cell lines,, which do not originally compris e undergo class switch recombination^ was 
25 examined by Northern blotting in the same manner as in the above. 

[00225] B cell lines (lyD9, BA/F3, 70Z/3, WEHI23 IHlT cell lines (EL-4, 2B4H; 
myeloma cell lines (X63, HEHI-3V); Fibroblast fibroblast lines (L929, NIH3T3,); the-and 
other cell lines (F2, P815, ST2). 

[00226] The expression of mRNA for the novel protein 23C9 was not observed in any 
30 of these cells. 
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[00227] Example 4: Cloning of a full length cDNA encoding a novel protein 23C9. 
[00228] Four different positive clones were obtained by screening a_cDNA library 
(Nucleic Acids Res., 26:911-918, 1998) prepared from mouse B cell clone CH12F3-2 
stimulated with IL-4, TGF-P, and CD40L, using a cDNA fragment (1,020 bp) encoding the 
5 novel protein 23C9, obtained in the above Example^ as a probe. A rThe nucleotide sequence 
of each clone was determined by using a DNA sequencer according to the standard method. 
[00229] One clone comprises a 1.2 kb nucleotide length and a single reading frame 
(©PFORF) with 1 polyadenylation site. The other 3 clones comprise a 2.4 kb nucleotide 
length and 2 polyadenylation sites. Ar The nucleotide sequence m-of_ the first 1.2 kb paf^at 
10 the 5' sid e in end of the latter clones was identical to that of the nucleotides m-of the 1.2 kb 
DNA in the former (SEQ ID NO:l). 

[00230] Two different mRNA transcripts detected in Northern blotting in the above 
Example (FIGs. 3 and 4) were predicted to correspond to transcripts for each of the above 1.2 
kb and 2.4 kb, transcribed ttsmg-untilthe polyA site at the 3' end and the polyA site at the 
15 5 'end. 

[00231] A cDNA fragment encoding the novel protein 23C9 used as a probe in the 
above (1,020 bp) was found to be- have a nucleotide sequence of from 847 to 1866 in the full 
length cDNAof23C9. 

[00232] A nucleotide sequence near a first initiation codon in each cDNA was fit to 
20 Kozak's rule (Nucleic Acids Res., 15:8125-8148, 1987). In the 2.4kb cDNA, ATTTA, which 
is a motif capable of mediating quick degradation of mRNA (Blood, 83:3182-3187, 1994^ 
was present in the 2 sites in the Huntranslatedienal regio n in th e 3 'side . 
[00233] An open reading frame (ORF) of cDNA encoding_a-the novel protein 23C9 
consisted of 198 amino acids with the expected molecular weight of about 24_kDa (SEQ ID 
25 NO:2)t-}^As a result of homology searching with known proteins bv- in a database, it was 
discovered that the an amino acid sequence e £of the ORF of the novel protein 23C9 
comprised 34% amino acid homology with the apolipoprotein B mRNA editing enzyme, 
catalytic polypeptide-1 (APOBEC-1) (Science, 260:1816-1819, 1993; J. Biol. Chem., 
268:20709-20712. 1993). GenBank and EMBL were used as the DNA data bases. SwissPlot 
30 was used as the protein database. .The BLAST program (J. Mol. Biol., 215:403-410, 1990) 
andjhe FASTA program (Proc. Natl. Acad. Sci. USA., 85:2444-2448 1988) were used ferto 
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search the databases-search. 

FIG. 5 shows an amino acid sequence o f the ORF of the novel protein 23 C9 and an 
alignment between the sequence and that of mouse APOBEC-1 amino acid sequence. 
[00234] As a r e sult of motif Motif searching en-online using PROSITE (Nucleic Acids 
5 Res., 11:2013-2018, 1992 ^) indicates that the APOBEC-1 -like novel protein 23C9 

comprises a_cytidine/deoxycytidine deaminase motif which is conserved in a -the amino acid 
sequences of arproteins belonging t eto the cytosine nucleoside/-nucleotide deaminase family 
which constructs ig_a large family and is an activation site of a deaminase activity. A-The 
cytosine nucleoside/nucleotide deaminase family is classified into RNA editing deaminases, 
10 cytidine/deoxycytidylate deaminases, and CMP/dCMP deaminases based on-the substrate 
specificity and homology in the activation sites (Cell, 81:187-195, 1995). 
[00235] A phylo genie phylo genetic tree was prepared based on the alignment-ameng 
of regions in APOBEC-1 a which is an RNA editing deaminases, cytosine nucleoside 
deaminase, cytosine nucleotide deaminases, and the cytidine deaminase motif in the novel 
15 protein 23C9. _The sequences m-of the known proteins used for the comparison were 
obtained from GenBank, as follows. 

Human derived nucleoside deaminase: L27943 

Mouse derived nucleoside deaminase: AA3 88666 

S. subtilis derived nucleoside deaminase: U18532 
20 E. coli derived cytidine deaminase: X63 144 

Rabbit derived APOBEC-1: Ul 0695 

Human derived APOBEC-1 : L25877 

Rat derived APOBEC-1 : U10695 

Mouse derived APOBEC-1: U21951 
25 T2/T4 phage derived nucleotide deaminase: JOS 1 72 

Human derived nucleotide deaminase: L12136 

S. cerevisiae derived nucleotide deaminase: U10397 

[00236] FIG 6 shows the result. Cytidin e The cytidine deaminase motif in the novel 
30 protein 23C9 was rath e r r e lativo more closely related to a subgroup of RNA editing 
deaminases than to subgroups of nucleoside deaminase and nucleotide deaminase. 
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[00237] On the other hand, a leucine-rich region existing at the C-terminus of 
APOBEC-1 is thought to be important for protein-protein interaction (Proc. Natl. Acad. Sci. 
USA., 91:8522-8526, 1994; J. Biol. Chem., 269:21725-21734, 1994). The novel protein 
23C9 also comprised a leucine-rich region at the C-terminus. Four leucines in the region of 
5 23C9 were conserved in the leucine rich regions of APOBEC-1 in rabbit, rat, mouse and 
human. 

[00238] It has been known that Phe66, Phe87, His61, Glu63 and Cys93 are essential 
for binding of APOBEC-1 to RNA, and all these amino acid residues were conserved in the 
primary structure of 23C9 (Trends Genet., 12:418-424, 1996; Cell, 81:187-195, 1995; J. Biol. 

10 Chem., 270:14768-14775 1995; J. Biol. Chem., 270:14762-14767, 1995). From this fact, 
23C9 protein is predicted to comprise an RNA editing deaminase activity. 
[00239] Moreover, cytidine deaminases derived from APOBEC-1 and E. coli 
(ECCDA) are known to comprise a pseudoactive site domain at the C-terminus., and the 
23C9 protein also compris e d comprises a pseudoactive site domain , the same as js in the 

15 APOBEC-1. This indicates that 23C9 protein is more closely related r etative-to APOBEC-1 
and ECCDA than to deaminase proteins in the other groups. 

[00240] From these facts, the novel protein 23C9 was named activation-induced 
cytidine deaminase (AID). The novel protein 23 C9 was called AID hereafter. 

20 [00241 ] Example 5 : Preparation of the AID-GST fusion protein 

[00242] The cDNA coding a-the full length AID cloned in the above Example was 
amplified by PCR with a pair of primers, AID-138 (SEQ ID NO:3) and AID-161 (SEQ ID 
NO:4), a pair of primers, AID-118 (SEQ ID NO:5) and AID-119 (SEQ ID NO:6) , using and 
Taq Polymerase b^following the standard method. As there is an intron between AID-118 

25 and AID- 1 1 9, a PCR product derived from AID genomic DNA can be easily distinguished. 
[00243] The obtained PCR product was subcloned te- into the p GEX4Tl vector 
(Pharmacia) according to the standard method. _A nucleotide sequence of the vector was 
determined and the absence of point mutations derived from the use of Taq polymerase in the 
full length AID cDNA cloned te -into the vector was confirmed. 

30 [00244] E. coli DH5a was transformed with the vector according to the standard 
method. The obtained transformants were cultured, and a full length AID cDNA was 
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expressed as a fusion protein with glutathione S-transferase (GST). The AID-GST fusion 
protein was extracted in the same manner as in the previous report, and purified using 
glutathione agarose affinity chromatography (J. Biol. Chem., 270:14768-14775 1995). 
[00245] A rThe molecular weight of the purified AID-GST fusion protein was analyzed 
5 by following the standard method using 10% SDS-PAGE and silver staining. A protein 
extracted from wild type E. coli DH5a was used as a control. FIG 7 shows the results. 
[00246] As expected, the fusion protein was detected as a band comprising a 
molecular weight of about 49 kDa. Minor bands detected under the-about 49 kDa were 
thought to be decomposed proteins, frequently generated in the purification process in 
10 general. 

[00247] A molecular weight of the purified AID-GST fusion protein was analyzed by 
the Western blotting methed-according to the standard method (Genomics, 54:89-98, 1998). 
Anti-AK) protein antibody to be used for the assay was prepared by immunizing a 
commercial rabbit for th e e xp e riment _with multiple antigen peptides including synthetic 
15 peptides corresponding to amino acids N QNos.t 1 16 to 132 of the AID protein of the present 
invention (Proc. Natl. Acad. Sci. USA., 85:5409, 1988). 
[00248] FIG 8 shows the results. 

[00249] Example 6: Cytidine deaminase activity of the AID protein 
20 [00250] A cytidine deaminase activity of AID was measured by the same method as in 
the previous report (J Biol. Chem. 270:14768-14775, 1995). 

[00251] The purified AID-GST fusion protein prepared in the above (2, 4, 6, 8, 10, 20, 
40, 60, 100, 200, 300, 400, and 600 ng) was incubated in the buffer (pH 7.5, the total amount 
10 jul) containing 45mM Tris with 3.3 jaCi [ 3 H] deoxycytidine (24.8 Ci/mmol, DuPont) and 

25 250 \jM cytidine for 2 to 4 hours. The reaction was terminated by adding deoxycytidine (2 [il 
of 10 |ug/ml) and deoxyuridine (2 jal of 10 (ig/ml). Insoluble substances were removed by 
centrifugation, and the reaction mixture (4 jal) was subjected to the polyethylene imine- 
cellulose thin layer chromatography plate (VWR). The plate was developed in isopropyl 
alcohol-/-10% HC1 (7:2 v/v). The plate was exposed to ultraviolet light (254 nm) for 

30 visualization and bands corresponding to deoxycytidine and deoxyuridine were collected, 
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and added to Ultima Gold scintillation solution to be quantified by liquid scintillation 
photometer (Packard) 

[00252] FIG 9 shows the results. As a r e sult The ^ _ALD protein showed the-a 
concentration-dependent cvtidine deaminase activit y d e p e nding on th e conc e ntrations . 
5 [00253] An-The inhibitory effect of tetrahydrouridine (THU; 0 to 40 |iM) 

(Calbiochem, US A) A which is an inhibitor specific to cytidine deaminase, on the cytidine 
deaminase activity in the AID-GST fusion protein (300 ng) was measured by the same 
method described above. 

[00254] FIG 10 shows the results. The cytidine deaminase activity of ATP protein was 

1 0 inhibited dep e nd e ncy depending on the concentrations of THU. 

[00255] Baeh -The inhibitory effect of 1 , 1 0-ophenanthroline (0 to 20 mMX which is a 
zinc-chelating agent, and its inactive form from - isomer 1 ,7-o-phenanthroline (0 to 20 mM^ 
on the cytidine deaminase activity mof the AID-GST fusion protein was measured in the 
same manner as described below. 

1 5 [00256] FIG 1 1 shows the results. The cytidine deaminase activity of AID protein was 
inhibited by 20 mM 1,10-o-phenanthroline by about 91%. _1 ,7-o-phenanthroline a which is the 
inactive isomer^ only inhibited about 13%. These results indicate that the AID protein is a 
zinc-dependent cytidine deaminase, sam e as similar to APOBEC-1 . 

20 [00257] Example 7: Avidity of AID protein with AU-rich RNA 

[00258] A recombinant APOBEC-1 binds to AuAU-rich RNA (Trends Genet., 12:418- 
424, 1996; Cell, 81:187-195, 1995; J. Biol. Chem., 270:14768-14775, 1995; J. Biol. Chem, 
270:14762-14767, 1995), and progresses RNA editing for apoB in the presence of chicken 
extract including co-factor. 

25 [00259] Since the AID protein has a .functional cytidine deaminase activity as well as a 
structural similarity with-to_APOBEC-l, to examine aa-RNA editing activity in the ATP 
protein, avidity to AU-rich RNA (5-AU) and apoB RNA (which are RNA substrates for 
APOBEC-1} was examined. 

[00260] The AID protein did not show avidity to AU-rich RNA (5-AU) in the gel 
30 retardation assay. In an in vitro apoB RNA assay, conversion from cytidine (C) to uridine (U) 
was not observed. 
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[00261] Example 8: Expression distribution of AID mRNA in tissues 
[00262] The expression of AH) mRNA in each tissue was examined by Northern 
blotting according to the standard method (L. Sambrook, E. F., Tom Maniatis., Second 
5 edition, Ed. Molecular Cloning (Nolan C, Ed.), Cold Spring Harbour, 1989; Experimental 
Medicine, Supply, "Genetic Engineering Hand Book", published by Yodosha, p. 133-140, 
1992). 

[00263] PolyA + RNA (2 \ig each) obtained from cells derived from each tissue in mice 
(muscle, spleen, lung, heart, lymph node, brain, kidney, thymus, testis, liver) according to the 
10 previous report (Nucleic Acids Res., 26:911-918, 1998) was used as a sample. Radiolabeled 
cDNA fragments (1,020 bp) encoding AID (23C9) obtained in the previous Examples was 
used as a probe for blotting polyA + RNA. 

[00264] As a control, mRNA of glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) 
was blotted in the same manner., AS-As a probe for blotting GAPDH mRNA, DNA amplified 
15 by PCR using GP primer and GR primer was used. (Nucleotide location: 566-1016, 
Genbank, U52599) (Immunity, 9:1-10, 1988). 
[00265] FIG 12 shows the results. 

[00266] As-a The result s show thaU AID mRNA was strongly expressed in mesenteric 
lymph node. In addition, weak expression was observed in spleen. 

20 

[00267] Example 9: Expression of AID mRNA in various lymphatic tissues. 
[00268] The expression of AID mRNA in each lymphatic tissue was analyzed by RT- 
PCR according to the standard method (Immunity, 9:1-10, 1998). 
[00269] cDNA was prepared according to the standard method using polyA + RNA 

25 obtained from cells derived from various lymphatic tissues (Payer's patch, mesenteric lymph 
node, axillary lymph node, spleen, bone marrow, thymus) in the same manner as in the 
previous report (Nucleic Acids Res., 26:911-918, 1998), for mRNA as a sample, as a 
template. AID cDNA and GAPDH cDNA were amplified using the obtained cDNA as a 
template. The -A first p air of primers, AID-138 (SEQ ID NO:3) and AID-161 (SEQ ID NO. 

30 4) in-theas above, a second p air of primers AID-1 1 8 (SEQ ID NO:5) and AID-1 19 (SEQ ID 
NO:6X and Taq polymerase were used for PCR of AID cDNA. As there is an intron between 
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AID-1 18 and AID- 1 19, a PCR product derived from the AID genomic DNA sequence can be 
easily distinguished. 

[00270] FIG 13 shows the results. 

[00271] AID cDNA was detected in all lymphatic tissues except for thymus. In 
5 particular, the-obvious expression was observed in peripheral lymphatic organs, such as 

lymph node or Payer's patch. On the other hand, the-expression in primary lymphatic organs 
was weak in comparison with that in the peripheral lymphatic organs. 

[00272] Example 10: Expression of AID mRNA as -over time go e s in activated mouse 

10 B cell clone CH12F3-2 

[00273] Expression of AID mRNA as -over time goes in activated mouse B cell clone 
CH12F3-2 stimulated with IL-4, TGF-p, and CD40L for 0 to 60 hours was analyzed by 
Northern blotting according to the standard method (L. Sambrook, E. F., Tom Maniatis., 
Second edition Ed. Molecular Cloning (Nolan, C, Ed.>7 Cold Spring Harbor, 1989). 

15 [00274] Mouse B cell clone CH12F3-2 was cultured in the presence of IL-4, TGF-p, 
and CD40L for fee-various periods (0, 3, 5, 12, 24, 36 A 48^ or 60 hours). 
[00275] Northern blotting was conducted against mRNA (10 /ig in each group) 
obtained from each culture group in the same manner as in the previous report (Nucleic 
Acids Res., 26:911-918, 1998) using a radiolabeled cDNA fragment encoding AID (23C9) 

20 obtained in the previous Examples, as a probe, according to the standard method (L. 

Sambrook, E. F., Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.). 
Cold Spring Harbor, 1989). 

[00276] The amount of mRNA to be gel-electrophoresed was adjusted by using mRNA 
of GAPDH as an index. DNA amplified by RT-PCR using a_GF primer and aj3R primer was 
25 used as a probe for blotting GAPDH mRNA (Nucleotide location: 566-1016, Genbank 
U52599) (Immunity, 9:1-10, 1998). 
[00277] FIG. 14 shows the results. 

[00278] It was shown that Tthe expression of AID mRNA in mouse B cell clone 
CH12F3-2 was too small low to be detected without the-stimulation by cytokines, but that th e 
30 expression was initiated 3 hours after the-stimulation by cytokines (described in the above), 
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was maximum 12 hours after the-stimulation (more than about 15 times), and was gradually 
decreased from 48 hours after the stimulation. 

[00279] Example 1 1 : Cytokine specificity to inducing in the induction of expression 
5 of AID mRNA in mouse B cell clone CH12F3-2. 

[00280] Cytokine specificity to-in inducing expression of AID mRNA in mouse B cell 
clone CH12F3-2 was analyzed by Northern blotting according to the standard method 
£L. Sambrook, E. E, Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.), 
Cold Spring Harbour, 1 989). 

10 [00281] Mouse B cell clone CH12F3-2 was cultured in the presence of various 
combinations of cytokines (one or more selected from JL-4 ± TGF-(3, and CD40-L) for 12 
hours. Northern blotting was conducted against mRNA (10 jig in each group) obtained from 
each culture group in the same manner as in the previous report (Nucleic Acids Res., 26:911- 
918, 1998) using a radio-labeled cDNA fragment (1,020 bp) coding AID (23C9) obtained in 

15 the previous Example, according to the standard method (L. Sambrook, E. E, Tom Maniatis., 
Second e dition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Harbor, 1989). 
[00282] The amount of mRNA to be gel-electrophoresed was adjusted by-using mRNA 
of GAPDH as an index. _DNA amplified by RT-PCR using a_GF primer and a_GR primer was 
used as a probe for blotting GAPDH mRNA (Nucleotide location: 566-1016, Genbank 

20 U52599) (Immunity, 9:1-10, 1998). 

[00283] FIG 15 shows the results. 

[00284] Expr e ssion i lnduction of AED-mRNA expression was small by solely any one 
kind of cytokines was small . _On the other hand, when 3 kinds of cytokines as_described 
above were used at the same time, th e maximum expression maximal induction of AID- 
25 mRNA expression was observed. 

[00285] As described in the above Example 3, because e xpression induction of AID 
mRNA expression was inhibited by cycloheximidei which is an inhibitor fer-of protein 
synthesis, it is hypothesized that enhanced expression of AID mRNA needs- requires de novo 
protein synthesis. 



66 



[00286] Example 12: Expr e ssion i lnduction of AID mRNA Eexpression in spleen B 
cells by stimulation 

[00287] The presence or absence of expression induction of AID mRNA by -after 
stimulation which may activate B cells and induce class switch recombination of 
5 immunoglobulin was examined. 

[00288] Spleen B cells was were obtained p urified and obtain e d from BALB/c mouse 
(6 to 12-week old, Shimizu Experimental Materials (SLC)) and purified according to the 
standard method. _Dead cells and cell fragments were removed by Ficoll density gradient 
centrifugation after the process of removing T cells. The purified spleen B cell s were-was 
10 cultured for 4 days in the presence of a stimulus in various combinations (one or more 

selected from IL-4, TGF-p, CD40L i and LPS (lipopolysaccharide) in the same manner as in 
the previous report (Nucleic Acids Res., 26:911-918, 1998). LPS derived from Salmonella 
typhosa (50 jig/ml, Sigma) was used. 

[00289] Northern blotting was conducted against mRNA (15 \ig in each group) 
15 obtained from each culture group in the same manner as in the previous report (Nucleic 
Acids Res., 26:911-918, 1998) using a radio-labeled cDNA fragment encoding AID (23C9) 
obtained in the previous Example, according to the standard method (L. Sambrook, E. R, 
Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.) T Cold Spring Harbor, 
1989). 

20 [00290] The amount of mRNA to be gel-electrophoresed was adjusted by using mRNA 
of GAPDH and 28S ribosomal RNA as an index. _DNA amplified by RT-PCR using a_GF 
primer and a^GR primer was used as a probe for blotting GAPDH mRNA (Nucleotide 
location: 566-1016, Genbank U52599) (Immunity, 9:1-10, 1998). 
[00291] FIG 1 6 shows the results. 

25 [00292] The enhanced expression of AID mRNA by the-stimulation with LPS on^ er 
LPS-iTL-4, or LPS+TGF-p was observed in normal mouse spleen B cells. 

[00293] Example 13: Induced expression of AID mRNA in vivo 
[00294] It was examined whether the e xpression induction of AID mRNA expression 

30 by various stimulations in vitro would also occur in vivo. 
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[00295] BALB/c mease- mice (6 to 12-week old, five individuals in each group, SLC) 
was -were immunized by intraperitoneally administering sheep red blood cell (SRBC) (IX 
10 8 cells, Cosmo Bio.). _In the living body immunized by SRBC, it has been known that 
clonal expansion and germinal center formation occur after the immunoresponse. and class 
5 switch recombination of an immunoglobulin gene and affinity maturation are caused. 
[00296] PolyA RNA was prepared from splenocytes isolated from spleen excised 
from each individual before (day 0) and after (day 2, 5 and 13) the immunization. 
[00297] The Polv ApolvA + -RNA (2 |ug each) was subjected to the-Northern blotting 
using the radiolabeled cDNA fragment (1,020 bp) encoding AID (23C9) as a probe in the 
10 same manner as the above Examples. _The amount of mRNA to be gel-electrophoresed was 
adjusted using mRNA of GAPDH as an index in the same manner as in the above Examples. 
[00298] FIG 17 shows the results. 

[00299] The minimum amount of expressed AID mRNA was detected before 
immunization of SRBC (day 0), however, a significant enhancement of expression (about 4 

15 to 5 times) was observed day 5 and day 13 after the immunization. 

[00300] Moreover, to analyze in which cell type enhanced expression of AID mRNA 
occurs, RT-PCR was conducted by the standard method (Immunity, 9:1-10, 1998). 
[00301] Red blood cells were removed from splenocytes obtained from spleen which 
was obtained 5 days after the immunization of SRBC in the same manner as the above, and 

20 T cells and non--T cells were separated using nylon fiber (Wako Pure Chemicals) in the same 
manner as in the previous report (Eur. J. Immunol., 3:645-649, 1973). T cell fractions 
contained more than 90% of CD3 positive cells, and less than 20% B 220 positive cells. 
[00302] T c e ll fraction (aft e rr e moval of B c e lls) and B cell fractions w e re 
conc e ntratod Concentration of T cell fractions (removal of B cells) and concentration of B cell 

25 fractions were carried out by MACS method with magnetic beads conjugated to anti-CD 19 
antibody (Miltenyi Biotech.). B220 positive B cells included in the fraction in which T cells 
were removed were 5% or less. On the other hand, B220 positive B cells included in the 
fraction in which CD 19 positive cells were concentrated were 60% or more. 
[00303] cDNA was prepared by reverse transcriptase according to the standard method 

30 using polyA + RNA prepared from each fractionated cell group. _AID cDNA and GAPDH 
cDNA were amplified by PCR using the obtained cDNA as a template. For PCR of AID 
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cDNA, the previously described pair of primers, AID-138 (SEQ ED NO;3) and AID-161 
(SEQ ID NO:4), and the previously described pair of primers, AID-118 (SEQ ID NO:5) and 
AID-119 (SEQ ID NO:6), as well as Taq polymerase^ were used. 
[00304] FIG. 1 8 shows the results. 
5 [00305] As a r e sult, i ln the CD 19 positive B cell fraction and non-T cell fraction, 
amplification of AID cDNA was observed.. Specifically, it was demonstrated that enhanced 
expression of AID mRNA induced by immunization by SRBC occurs in spleen CD 19 
positive B cells. 

10 [00306] Example 14: Localization of AID mRNA expression in lymphatic organs 
[00307] It was found that timing of enhanced expression of AID mRNA in spleen is 
almost consist e nt coincident with the initiation of germinal center (GC) formation after 
immunization of SRBC, from the result of the previous Examples. In this 
e xamination experiment an -the p recise localization of AID mRNA expression in lymphatic 

15 organs was analyzed using in situ hybridization. 

[00308] AID cDNA, cleaved out by digestin g the pGEX4Tl vector in which cDNA 
encoding the AID protein has been subcloned; with EcoRI and XhoI A was subcloned into 
plasmid pBluesciptSK (+) (Stratagene). The plasmid was digested with EcoRI or Xhol to 
obtain linearized plasmid DNA and transcribed into RNA using the plasmid as a template in 

20 the presence of digoxigenin-labeled rUTP (Boehringer-Mannheim) using T3 RNA 

polymerase or T7 RNA polymerase to prepare e ach of digoxigenin-labeled antisense probes 
and sense probes. 

[00309] On th e oth e r hand, fF rozen tissue slices were prepared by immobilizing with 
paraformaldehyde from each of the_spleen and pav e r's Paver's p atch in a normal mouse as a 
25 lymphatic organ sample. A normal mouse was immunized with SRBC in the same manner as 
in the above Examples, and frozen tissue slices immobilized with paraformaldehyde fiem 
spleen -were obtained from spleen 5 days after the immunization. 

[00310] Hybridization was conducted by applying the digoxigenin-labeled antisense 
AID probes or sense AID probes to each of the slides furnished with each of immobilized 
30 slices. Hybridized digoxigenin-labeled AID probe was detected using anti-digoxigenin 
antibody conjugated with alkaline phosphatase. The localization of anti-digoxigenin 
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antibody conjugated to digoxigenin on the probe was identified by detecting a phosphatase 
reactant (dark purple color). This analysis was conducted using a light transmission 
microscope. 

[0031 1] In situ hybridization and detection of riboprobes in this e xamination 
5 experiment were conducted in the same manner as in the previous report (J. Comp. Neurol., 
333:398-416, 1993). 

[00312] The location of the g erminal center in each tissue slice was identified by 
staining with PNA (Vector) conjugate with FITC and observing with a inflorescent 
microscope. 

1 0 [00313] FIGs. 19 and 20 show the results. 

[00314] In the e xamination experiments using the antisense AID probe, multiple 
obvious focal signals were observed in spleen tissue slices derived from SRBC immunized 
mouse (day 5 after the immunization) (FIGs. 19(E) and 20(E)), however, an^no signals were 
ne^detected in spleen tissue slices derived from mice which were not immunized with SRBC 

15 (FIGs. 19(B) and 20(B)). This result is consistent with the result of Northern blotting 

obtained in the above Example (FIG. 17). The ^ existence of a_germinal center was observed 
both in spleen tissue slices derived from SRBC-immunized mouse (day 5 after the 
immunization) (FIGs. 19(F) and 20(F)) as well as in the normal pav e r's Paver's p atch (FIG. 
20(1)), by staining with FITC-labeled PNA. _The expression of AID mRNA was found to 

20 localize in the g erminal center in the-both tissue slices. 

[00315] In the e xamination experiments using the sense AID probe, any signals as 
background w e r e not d e t e ct e d no background signals were detected in tissue slices of spleen 
or of pPayer's patch regardless of the presence or absence of the immunization by SRBC. 
[00316] This result indicates that the e xpr e ssion induction of AID mRNA expression 

25 occurs in-specifically te -in g erminal center B cells ? activated by stimulation with an antigen. 

[003 1 7] Example 1 5 : Isolation of human g enomic DNA encoding the AID protein 
d e riv e d from human 

15-1 Preparation of probes for hybridization 
30 [00318] PCR was conducted using an expression vector^ prepared by inserting cDNA 
encoding a full length mouse AID protein, (prepared in Example 5} into a plasmid vector 
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pGEX4Tl, as a template^ with a pair of primers (Primer 170: SEQ ID NO: 16 and primer 181: 
SEQ ID NO: 179), according to the standard method described in the above. 
[00319] The obtain e d resulting PCR product was purified by the standard methods 
described above and a nucleotide sequence of the purified DNA was determined by the-a 
5 direct s e qu e nc e sequencing method to confirm that the purified DNA is the nucleotide 
sequence encoding a full length e£mouse AID protein. This purified DNA was used as a 
probe for hybridization in the following experiments. 

15-2 Screening of human genomic DNA library 
1 0 [00320] The probe prepared in the above was labeled in the same manner as fertile 
radioactive probe in the above Northern hybridization to make a probe radio-labeled by a 
radioactive isomer. 

[00321] Using the labeled probe, a commercial human genomic DNA library 
(catalogue No. HL1067j; Lot No. 45003; CLONTECH) was screened by the-cross 

1 5 hybridization according to the standard method. 

[00322] Washing after the-hybridization was conducted twice in 2 X SSC (including 
0.1% SDS, und e r th eat room temperature, 10 min), and twice in 2 X SSC (including 0.1% 
SDS, 65°C, 30 min). Phage DNA was subcloned by purifying phage DNA and inserting 
about 22 kb genomic DNA a obtained by cleaving at NotI restriction enzyme site in the phage 

20 DNA, into the Not I restriction enzyme site in plasmid pZero-2. 1 . This plasmid was named 
3CpZero. 

[00323] A DNA fragment obtained by digesting 3CPZero with PstI was ligated to the 
PstI site of plasmid pBlueScript KS (Toyobo) and E. coli was transformed with this ligated 
DNA. 

25 [00324] Transformants were screened by the colony hybridization using the labeled 
probe prepared in the above according to the standard method, and multiple positive clones 
were obtained. 

[00325] Ar The nucleotide sequence of human genomic DNA inserted into each 
positive clone was analyzed and multiple clones containing genomic DNA of DNA encoding 
30 ajiuman AID protein were identified. 
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[00326] Among the multiple clones, nucleotide sequences of genomic DNA containing 
DNA encoding ahuman AID protein contained in two clones were-are described in SEQ ID 
NOs:9 and 10, respectively. 

[00327] Moreover ln addition , arthe nucleotide sequence of genomic DNA including 
5 the DNA encoding ahuman AID protein included in the positiv e other positive clone was-is 
shown in SEQIDNO:35. 

[00328] Example 16: Isolation of cDNA encoding a full-length human AID protein 

and preparation of human AID protein 
1 0 [00329] By comparing ar-the nucleotide sequence of genomic DNA,, including arthe 

coding region of the obtain e d human AID protein^ with cDNA nucleotide sequence encoding 

a-the full-length mouse AID protein (determined in the examples above), a human AID 

protein encoding region in the human genomic DNA was deduced. 

[00330] A pair of primers for RACE-PCR was designed based on the deduced 
15 nucleotide sequence of the coding region in the human AID protein (Primer 22: SEQ ID 

NO: 18, and primer 25: SEQ ID NO: 19). 

[00331] RACE-PCR was conducted using mRNA prepared from human B Lymphoma 
cell line RAMOS as a template with the above pair of primers according to tfee-aj)revious 
report (J. Biol. Chem., 274:18470-18476, 1999) by-following the standard method. ArThe 
20 nucleotide sequence of the obtain e d resulting PCR product was determined and cDNA 

encoding a full length human AID protein was obtained (cDNA sequence: SEQ ID NO:7, and 
amino acid sequence: SEQ ID NO: 8). 

[00332] As a rosult The results indicate that the r human AID protein (SEQ ID NO: 8) 
has extremely high homology in amino acid sequences with a-the mouse AID protein (SEQ 

25 ED NO:2) (FIG. 22). AThe a mino acid sequences in the Gcvtidine and deoxycytidilate 

deaminase zinc-binding region which is an active region in the AID protein (both mouse AID 
and human AID amino acid residues 56 to 94 of SEQ ID NOs: 2 and 8, respectivelv SE€MD 
NO: 5 6 to 9 4) were completely consistent (conserved) between mouse and human. 
[00333] As a-the partial amino acid sequence (amino acid NO: 1 16 to 132 in SEQ ID 

30 NO:2) of mouse AID protein used for the preparation of anti-AID protein antibody (Example 
5) was completely consistent with a-the corresponding amino acid sequence (amino acid NO: 
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1 16 to 132 in SEQ ID NO: 8) of human AID protein, the anti-AID protein antibody was 
expected to compris e cross-reactivity not only with mouse AID protein but also with human 
AID protein. 

[00334] Human AID cDNA obtained in the above was reconstructed according to the 
5 standard method in the manner of genetic engineering so that His- AID fusion protein^ add e d 
with His-tag (a peptide of histidine repeated 10 times)added at the N -termin us of al-4e-the 
human AID protein^ was produced, and an expression vector was prepared by inserting the 
cDNA into a plasmid pEF-BOS (Unexamined published Japanese patent No. Hei 2-242687). 
The vector was introduced into amonkey kidney- d e riv e d derived cell line,, COS7 a by 
10 lipofection using LIPOFECTAMINE (GEBCO BRL) according to the standard method. The 
obtained resulting transgenic cells were cultured by the-standard methods and His-human 
AID fusion protein was transiently expressed. His-human AID fusion protein was extracted 
and purified in the same method as the-previously reported, and the production of His-human 
AID fusion protein was analyzed by Western blotting with the anti-AID antibody prepared in 
15 Example 5 and a commercial anti-His tag antibody according to the standard method. _As^a 
r e sult, Tthe His-AED protein was detected as a band comprising about 31 kDa molecular 
weight in all cases using any -either antibody. 

[00335] Example 17: Determination of exons in the g enomic DNA encoding the 
human AID protein 

20 [00336] Based on the information for the nucleotide sequence o f the cDNA encoding 
the full length human AID protein above, exons in the nucleotide sequences for genomic 
DNA encoding the human AID protein in the above were determined. 
[00337] As a result, it -the genomic DNA was confirmed to consist of 5 exons. 
Exon 1 (Nucleotide sequence: SEQ ID NO: 11); 
25 Exon 2 (Nucleotide sequence: SEQ ID NO: 12); 

Exon 3 (Nucleotide sequence: SEQ ID NO: 13); 
Exon 4 (Nucleotide sequence: SEQ ID NO: 14); and 
Exon 5 (Nucleotide sequence: SEQ ID NO: 15). 
[00338] The-eExon 1 contains a translation initiation codon ATG which codes for the 
30 first methionine (Amino acid No: 1 of SEQ ID NO: 8) i n the human AID protein ? ;-and the 
initiation codon corresponds to nucleotides NOs: 80 to 82 in SEQ NO: 11. 
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[00339] Specifically, the genomic DNA including DNA encoding the human AID 
obtained in the above Examples (SEQ ID NO:9, SEQ ID NO: 10 and SEQ ID NO:35) 
consists of introns and exons as described below and comprises a full length of about 11 kb. 
FIG 23 schematically shows the structure. 
<SEQ ED NO:9> 

Intron: Nucleotide NoOs^t from 1 to 1031 

Exon 1 : Nucleotide NoOs^ from 1 032 to 1 1 1 8 

Intron: Nucleotide No©s..t from 1119 to 5514 

<SEQIDNO:10> 

Intron: Nucleotide NoOs^ from 1 to 1064 
Exon 2: Nucleotide NoOs.t from 1065 to 1212 
Intron: Nucleotide NoOs.t from 1 2 1 3 to 259 1 
Exon 3: Nucleotide NoOs.t from 2592 to 2862 
Intron: Nucleotide NoOs.t from 2863 to 3 1 55 
Exon 4: Nucleotide NoOs^ from 3156 to 3271 
Intron: Nucleotide NoOs.t from 3272 to 3740 
Exon 5: Nucleotide NoOs.t from 3741 to 5912 
Intron: Nucleotide NoOs^ from 5913 to 6564 

<SEQ ID NO:35> 

Intron: Nucleotide NoOs^ from 1 to 441 
Exon 1 : Nucleotide Neos.T from 442 to 528 
Intron: Nucleotide NoOs.t from 529 to 6279 
Exon 2: Nucleotide NoOs^ from 6280 to 6427 
Intron: Nucleotide NoOs.t from 6428 to 7806 
Exon 3: Nucleotide NoOs^ from 7807 to 8077 
Intron: Nucleotide NoOs^ from 8078 to 8370 
Exon 4: Nucleotide NoOs^ from 8371 to 8486 
Intron: Nucleotide NoOsj- from 8487 to 8955 
Exon 5: Nucleotide NoOs^ from 8956 to 11067 
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Intron: Nucleotide NoOs^ from 11068 to 11204 



[00340] Example 18: Amplification of a given partial nucleotide sequence of genomic 
DNA encoding the human AID protein by PCR A and diagnosis fep-of the presence or absence 
of mutations in the partial nucleotide sequence. 

[00341 ] The AID protein of the present invention may bejnvolved in sid e ration of the 
development of various immunodeficiency and allergic diseases.. For example, a given 
immunodeficiency or allergic disease may be caused by mutation or deletion in the 
nucleotide sequence of genomic DNA (especially in an exon) encoding an ATP protein. 
[00342] The presence or absence of such amutation or deletion in the g enomic DNA 
can be analyzed by, for example, the following examples. 

(1) A pair of primers comprising a nucleotide sequence complementary to 
a given partial nucleotide sequence of genomic DNA encoding the AID protein in the 
present invention is prepared. 

(2) Using genomic DNA encoding the AID protein obtained from tissues 
or cells of a patient suffering from immunodeficiency or allergic disease as a 
template, an objective partial nucleotide sequence of the genomic DNA is amplified 
with the pair of primers-BNA. 

(3) By analyzing the presence or absence of a PCR product and a 
nucleotide sequence of the PCR product, and comparing the nucleotide sequence with 
a corresponding nucleotide sequence in genomic DNA encoding the AID protein 
derived from a normal person, amutation or deletion in the genomic DNA is 
identified. 

[00343] Specifically, this method enables, for example, not only e lucidat e elucidation 
of the relationship between immunodeficiency or allergic disease and the AID protein, but 
also diagnosise the-of diseases by the above method in the case that AID protein is a cause of 
sid e ration development of a given type of disease (for example^ immunodeficiency or allergic 
disease). 

[00344] For the above purpose, the following 15 kinds of primers were designed and 
prepared based on a given partial nucleotide sequence in the genomic DNA encoding the 
human AID protein. 
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Primer: p3 (SEQ ID Ner-NO:20) 

Primer: p9 (SEQ ID Ner-N0:21) 

Primer: plO (SEQ ID Ne^-NO:22) 

Primer: pl2 (SEQ ID Ner-NO:23) 
5 Primer: pi 4 (SEQ ID Ner-NO:24) 

Primer: pi 6 (SEQ ID Ner-NO:25) 

Primer: pi 7 (SEQ ID Ner-NO:26) 

Primer: pi 9 (SEQ ID Ne^NO:27) 

Primer: p26 (SEQ ID Ner-NO:28) 
1 0 Primer: p29 (SEQ ID Ne^NO:29) 

Primer: p36 (SEQ ID Ne^-NO:30) 

Primer: p48 (SEQ ID Ne^NO:31) 

Primer: p59 (SEQ ID Ner-NO:32) 

Primer: p85 (SEQ ID Nev-NO:33) 
1 5 Primer: p86 (SEQ ID Ner-NO:34) 

[00345] By PCR using the above primers as-ina pairs in of prim e rs by the following 
combinations, and a genomic DNA isolated from human B lymphoma cell RAMOS as a 
template, a partial nucleotide sequence coding each target human AID protein was amplified. 
20 FIG. 21 shows the relative locations of genomic DNA partial nucleotide sequences amplified 
by each primer pair. 

(1) DNA comprising the nucleotide sequence of SEQ ID NO:3 1 and DNA 
comprising the nucleotide sequence of SEQ ID NO:32; 

(2) DNA comprising the nucleotide sequence of SEQ ID NO:20 and DNA 
25 comprising the n ucleotide sequence of SEQ ID NO:22; 

(3) DNA comprisin g the nucleotide sequence of SEQ ID NO:21 and DNA 
comprising the nucleotide sequence of SEQ ID NO:30; 

(4) DNA comprisin g the nucleotide sequence of SEQ ID NO:24 and DNA 
comprising the nucleotide sequence of SEQ ID NO:25; 

30 (5) DNA comprising the nucleotide sequence of SEQ ID NO:23 and DNA 

comprising the nucleotide sequence of SEQ ID NO:27; 
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(6) DNA comprisin g the nucleotide sequence of SEQ ED NO:23 and DNA 
comprising the nucleotide sequence of SEQ ED NO:28; 

(7) DNA comprising the nucleotide sequence of SEQ ED NO:23 and DNA 
comprising the nucleotide sequence of SEQ ED NO:29; 

(8) DNA comprising the nucleotide sequence of SEQ ED NO:26 and DNA 
comprising the nucleotide sequence of SEQ ED NO:27; 

(9) DNA comprising the nucleotide sequence of SEQ ED NO:26 and DNA 
comprising the nucleotide sequence of SEQ ID NO:28; 

( 1 0) DNA comprising the nucleotide sequence of SEQ ID NO:26 and DNA 
comprisin g the nucleotide sequence of SEQ ID NO:29; 

(11) DNA comprising the nucleotide sequence of SEQ ED NO:34 and DNA 
comprising the nucleotide sequence of SEQ ID NO:28; 

(12) DNA comprising the nucleotide sequence of SEQ ED NO:34 and DNA 
comprising the nucleotide sequence of SEQ ID NO:29; 

(13) DNA comprising the nucleotide sequence of SEQ ED NO:33 and DNA 
comprising the nucleotide sequence of SEQ ID NO:29; or, 

( 1 4) DNA comprising the nucleotide sequence of SEQ ED NO : 1 8 and DNA 
comprisin g the nucleotide sequence of SEQ ID NO: 19; 

[00346] The condition for PCR was set by the following manner. 
[00347] <Reaction solution> 

[00348] A total amount of 20.2 pi consisting of DDW (8 jil), 10 X buffer (2 pi), dNTP 
(2.5 mM each, 2 jil), 2 mM primer 1 (2 jul), 2 jiM primer 2 (2 pi), genomic DNA isolated 
from human B Lymphoma cells (185 ng/pl) and Taq polymerase (5 U/ml, 0.2 jil), Ex Taq 
(TAKARA), or Ampli Taq (Perkin Elmer). 

[00349] <Reaction> 

[00350] Reaction was conducted by -according to the following conditions (A) or (B). 

(A) Conducting 1 cycle_of ([reaction at 94°C for 30 sec)] and 40 cycles of 
([reaction at 94°C for 10 sec, reaction at 54°C for 30 sec, and reaction at 72°C for 3 
min and 30 sec)^ and subsequently stored at 4°C 
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(B) Conducting 1 cycle of ([reaction at 94°C for 30 sec)} and 40 cycles of 
[(reaction at 94°C for 10 sec, reaction at 55°C for 30 sec, and reaction at 72°C for 2 
min and 10 sec]^ and subsequently stored at 4°C. 



5 [00351] <PCR equipment 

[00352] A commercial PCR device (Perkin Elmer Thermal Cycler 9700 type) was 
used. 

[00353] Example 19: The expression of human AID mRNA in various human organ 
10 tissues 

[00354] The expression of human AID mRNA in various human organ tissues was 
analyzed by RT-PCR according to the standard method (Immunity, 9:1-10, 1998). 
[00355] RT-PCR was conducted bousing various tissues se^in the-a_human tissue 
cDNA panel (CLONTECH) as a template according to the-standard methods. 
15 [00356] AID cDNA was amplified by -using p rimers pl7 (SEQ ID NO:26) and p26 
(SEQ ID NO:28) prepared in th eas above A and Taq polymerase. 

[00357] As a control, RT-PCR was conducted in the same manner was conducted 
using cDNA of G3PDH as a template and GF primer and GR primer (Immunity, 9:1-10, 
1998). 

20 [00358] FIG. 24 shows the results. As a result, Specific expression of mRNA was 
confirmed in lymph node and tonsil. This result was consistent with the experimental result 
in which the expression of mRNA for mouse AID was observed in the various lymphatic 
tissues (Examples 8 and 9). 

[00359] On the other hand, when RT-PCR was conducted with the-a_saturated cycle 
25 number in the same manner as the above in the above RT-PCR, the expression of AID mRNA 
was observed in almost all analyzed organs. 



[00360] Example 20: Localization of human AID gene on human chromosomes 
[00361] Localization of the human AID gene on human chromosomes was analyzed 
30 by Fluoresc e nc e fluorescence in situ hybridization (FISH) method (Experimental Medicine, 
Suppl. "Genetic Engineering Hand Book" published by Yodosha, 1992, p. 271-277). 
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[00362] Genomic DNA including human AID gene (exon 1 to exon 5), isolated in the 
above, which was labeled with biotin-1 1-dUTP (Sigma) by the nick translation method 4 was 
used as a probe for hybridization. 

[00363] The probe was hybridized with chromosomes in metaphase human cells. 
5 Hybridization signals were detected using fluorescein isothiocyanate-avidin (DCS) (Vector 
Laboratories). 

[00364] FIG 25 shows the results. As a r e sult The human AID gene was found to be 
localized on chromosome 12pl3. Mor e ov e r, t This location was r e v e al e d to be is near 
12pl3.1 which is the location for th e abov e APOBEC-1 comprising which has a relatively 
10 high amino acid sequence homology with the AID protein^ and has the same cytidine 
deaminase activity as that of the AID protein. 

[00365] It has been reported that some abnormality on human chromosome locus 
12pl3.3-12pll.2, 12pl3.2-12p24.1 and 12pl3 may be involved in diseases such as 
acrocallosal syndrome, inflammatory bowel syndrome,, familial periodic fever, respectively, 
15 however, the causative gene thereof has not been traced yet. It has b ee n sugg e st e d that The 
human AID gene of the present invention may be involved in sid e ration the development of 
such diseases. 

[00366] All references and patents cited herein are incorporated by reference in their 
entirety. 

20 

INDUSTRIAL APPLICABILITY 
[00367] The AID protein of the present invention can be considered to have a function 
of regulating various biological mechanisms required for generation of antigen-specific 
immunoglobulins (specific antibodies), which eliminate non-self antigen (foreign antigen, 

25 self-reacting cells, etc.) that triggers various diseases. More specifically, the AID protein of 
the present invention can be considered to be one of the enzymes that plays an important role 
in the-genetic editing such as RNA editing,, and so on occurring in germinal center B cells, 
such as the_activation of B cells, class switch recombination of immunoglobulin genes, 
somatic hypermutation, and affinity maturation, which are the mechanisms for generation of 

30 immunoglobulins having high specificity te-for antigens. 
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[00368] The dysfunction of the AID protein of the present invention can be the cause 
fei^heof humoral immunodeficiency since it induces failure of germinal center B cell 
function, such as antigen-specific B cell activation, class switch recombination, and somatic 
mutation. R e v e rs e l y Converselv . the breakdown of the regulation of AID protein may induce 
5 all e rgy allergic diseases or autoimmune diseases since it can cause inappropriate B cell 
activation and needless class switch recombination and somatic mutation. 
[00369] Therefore, regulation of the function of AID protein and the gene encoding it 
enables preventiong and treat ment mg of various immunodeficiencies, autoimmune diseases, 
and allergies, which result from, for example, B cell dysfunctions (e.g. IgA deficiency, IgA 

10 nephropathy, y globulinemia, hyper IgM syndrome, etc.) or class switch deficiency of 
immunoglobulin. Thus, the AID protein and the gene encoding the AID protein can be 
targets for the development of drugs for therapy of the diseases mentioned above. 
[00370] Examples of diseases whose onset prevention, symptom remission, therapy 
and/or symptomatic treatment effect is expected by regulating the function of the AID protein 

15 of the present invention or the gene encoding it include, for example, primary 

immunodeficiency syndrome with congenital disorder of immune system, mainly 
immunodeficiencyies considered to develop by B cell deficiency, decrease, or dysfunction 
(e.g. sex-linked agammaglobulinemia, sex-linked agammaglobulinemia with growth 
hormone deficiency, immunoglobulin deficiency with high IgM levels, selective IgM 

20 deficiency, selective IgE deficiency, immunoglobulin heavy chain gene deletion, k chain 
deficiency, IgA deficiency, IgG subclass selective deficiency, CVID (common variable 
immunodeficiency), infantile transient dysgammaglobulinemia, Rosen syndrome, severe 
combined immunodeficiency (sex-linked, autosomal recessive), ADA (adenosine deaminase) 
deficiency, PNP (purine nucleoside phosphorylase) deficiency, MHC class II deficiency, 

25 reticular dysplasia, Wiskott-Aldrich syndrome, ataxia telangiectasia, DiGeorge syndrome, 
chromosomal aberration, familial Ig hypermetabolism, hyper IgE syndrome, Gitlin 
syndrome, Nezelof syndrome, Good syndrome, osteodystrophy, transcobalamin syndrome, 
secretary bead syndrome, etc.), various diseases with antibody production deficiency that are 
secondary immunodeficiency syndromes with a_disorder of the immune system caused by an 

30 acquired etiology (for example, AIDS, etc.), and/or various allergic diseases (e.g., bronchial 
asthma, atopic dermatitis, conjunctivitis, allergic rhinitis, allergic enteritis, drug-induced 
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allergy, food allergy, allergic urticaria, glomerulonephritis, etc.). These could be targets for 
drug development. 

[00371] Namely, the AID protein of the present invention, a fragment thereof, a DNA 
encoding the AID protein, a fragment thereof, and an antibody against the AID protein are 
5 useful as reagents for developing drugs for prevention and therapy of such diseases. 

[00372] Also, the DNA itself is useful as an antisense drug regulating the function of 
the AID gene at a- the g ene levels and in a use in gene therapy. The protein^ or the fragments 
thereof (e.g. enzyme active site^ itself is useful as a drug. 

[00373] Furthermore, an antibody reactive to the AID protein of the present invention 
10 or a fragment thereof is extremely useful as an antibody drug by-to r e gulating regulate 
functions of the AID protein. 

[00374] Furthermore, the gene (DNA), protein, and antibody of the present invention 
are useful as reagents for searching foLsubstrates (e.g., RNA, etc.) that interacting (bindmg) 
with the protein (enzyme) of the present invention, or other auxiliary proteins associated with 
15 the protein of the present invention, and for developing drugs targeting the substrates and 
auxiliary proteins. 

[00375] Furthermore, a method for identifying a substance that regulates production of 
the AID protein of the present invention or transcription of a gene encoding the ATP protein 
into mRNA aFe-is_extremely useful as ameans to develop drugs for therapy and prevention of 
20 various diseases (especially, immunodeficiency and allergic disease^ in which the above- 
mentioned AID protein or AID gene is considered to be involved. 
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ABSTRACT 

Genes encoding novel proteins named AID (Activation-Induced cytidine Deaminase), 
haviftg- that are structurally relatedienship to APOBEC-1, on e of an RNA editing enzymes, 
and haveing a cytidine deaminase activity similar to APOBEC-1 , haves been found by 
5 preparing cDNA libraries fe ^from mouse B cell clone CH12F3-2 T (which undergoes class 
switch recombination from IgM to IgA at an extremely high rate tog e th e r with after 
activation of the cells by stimulation with cytokines), with and without stimulationng with 
cytokines, and performing subtraction cloning using the libraries. 
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